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^ ■ ABSTRACT 

C/3 , This paper studies the relative spatial distribution of red-sequence and blue-cloud 

^ ^ ^ ■ galaxies, and their relation to the dark matter distribution in the COMBO-17 survey as 

function of scale down to z ~ 1. We measure the 2'^'^-order auto- and cross-correlation 
, functions of galaxy clustering and express the relative biasing by using aperture statis- 

^ ' tics. Also estimated is the relation between the galaxies and the dark matter distribu- 

I tion exploiting galaxy-galaxy lensing (GGL). All observables are further interpreted in 

lO i terms of a halo model. To fully explain the galaxy clustering cross-correlation function 

' with a halo model, we introduce a new parameter, R, that describes the statistical 

I correlation between numbers of red and blue galaxies within the same halo. 

V/^ ■ We find that red and blue galaxies are clearly differently clustered, a significant 

' evolution of the relative clustering with redshift is not found. There is evidence for 

OO . a scale-dependence of relative biasing: The linear relative bias factor varies slightly 

■ between b ^ 1.7 ± 0.5 and b ^ 2.2 ±0.1 on spatial scales between roughly 100 h^^kpc 

and 7/i~^Mpc, respectively. The linear correlation coefficient of galaxy number den- 
sities drops from a value near unity on large scales to r 0.6 ± 0.15. Both biasing 
' trends, the GGL and with some tension the galaxy numbers can be explained con- 

JH I sistently within a halo model. Red galaxies typically start to populate haloes with 



masses starting from > iQi^ iiO-Z h~^MQ, blue galaxies from > iQ^^-^^o.i i^-^Mq. For 
the cross-correlation function one requires a HOD variance that becomes Poisson even 
for relatively small occupancy numbers. This rules out for our samples with high con- 
fidence a "Poisson satellite" scenario, as found in semi-analytical models. We compare 
different model flavours, with and without galaxies at the halo centres, using Bayesian 
evidence. The result is inconclusive. However, red galaxies have to be concentrated 
towards the halo centre, either by a red central galaxy or by a concentration param- 
eter above that of dark matter. The value of R depends on the presence or absence 
of central galaxies: If no central galaxies or only red central galaxies are allowed, R is 
consistent with zero, whereas a positive correlation R = +0.5 ± 0.2 is needed if both 
blue and red galaxies can have central galaxies. 

Key words: galaxies: statistics - dark matter - large-scale structure of Universe - 
cosmology: theory - cosmology: observations - gravitational lensing 



1 INTRODUCTION 

Today a confusing wealth of different galaxy populations 
is known, which yet is thought to have arisen from a 
E-mail: psimon@roe.ac.uk fairly simple early Universe. Morphologically, local galax- 
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ies fall into two broad classes: early- type galaxies, with al- 
most spheroidal appearance and none or only a very small 
disk component, and late-type galaxies, with a small cen- 
tral bulge and a dominating stellar disk exhibiting different 
degrees of spiral structure and star formation. Within the 
context of the cold dark matter paradigm for cosmologica l 
structure formation (jCroft et al.ll200 ISpringel eFaLlbOOSl ) 
galaxies merge, grow and interact with the ambient inter- 
galactic medium by parti cipating in a hierarchical merging 
process l|Cole et al.ll2000l ). The ongoing research is trying 
to test whether the today's known variety of galaxies can 
indeed be explained within this paradigm. 

To trace the evolution of galaxy populations with time, 
a morphological identification of a large sample of galax- 
ies down to higher redshifts has proven to be difficult. The 
most practical solution to this problem is to exploit the bi- 
modal distribution of galaxies in a colour-magnitude dia- 
gram (CMD). In such a diagram early- and late-type galax- 
ies can roughly be separated do wn to redshifts of z ^ 1, pos- 
sibly even beyond that (see e.g. iLin et ahlbOOSl : iFaber et al.l 
|2007; Bell et al. 20q3, and references therein). The red mode 
in the CMD is the well-known colour-magnitude relation 
(CMR), or red-sequence, of early-type galaxies. The blue 
mode is often referred to as the blue cloud galaxies. To 
distinguish between a re d and a b lue galaxy population 
we proceed according to iBeU et al.l C2004), using a (rest- 
frame) U — V vs. Mv CMD and cut the galaxy sample 
along the CMR to obtain a red-sequence and blue-cloud 
sample. In adopting this division line about 80% of the se- 
lected red galaxies have morphologies earlier than or equal 
Hubble type Sa, while the blue-cloud galaxies are mainly 
late-type, star forming galaxies. A better morphological sep- 
aration of galaxies, not pursued for this paper though, may 
b e achieved by apply ing inclination corrections as discussed 
in lMaller et al.l l|2008l ') that have been tested for \aw-z galax- 
ies from SDSS and 2dF. 

The so far strongest observational clues about the emer- 
gence of the red-sequence from the blue cloud come from 
careful number counts in CMDs and estimates of the galaxy 
lumin osity functions for different redshifts. iFaber et al.l 
have found strong evidence that the red-sequence 
has been built up by a mixture of dry mergers between red- 
sequence galaxies, wet mergers between blue-cloud galaxies 
and quenching of star formation with subsequent aging of 
the stellar populations of blue-cloud galaxies. 

Another important source of information hinting to the 
nature of galaxies is their spatial distribution. For example, 
early-type galaxies are preferentially found in the cores of 
rich galaxy clusters where their fraction is about 90 per- 
cent, whereas outside of galaxy clusters about 70 perce nt of 
the field galaxies are late- type galaxies ()Dressledll980l ). As 
another example, it has also been found by modelling stel- 
lar populations of local early-types that the star-formation 
history of early-type galaxies depends on the galaxy density 
of the environment (jThomas et al.|[2005l ). 

One traditional way to study the spatial distribution of 
galaxies is to look at correlations in the galaxy distr ibution, 
in particular the two-point correlation function (|Peeblesl 
ll980l : lTotsu'ir fc Kiharall 19691 ) . Analyses revealed that galaxy 
clustering depends on the properties of the galaxy popu- 
latio n like mo rphology, col our, luminosity or spectral type 
(e.g. ICoil et al., .200a : ,Zehavi et all I2OO5I : iMadgwick et all 



|2003| . and references therein). Therefore, different galaxy 
populations are differently clustered - biased - with re- 
spect to the total matter component and with respect to 
each other. The detailed dependence of spatial clustering on 
galaxy characteristics, scale and redshift is a opportunity 
to learn more about the for mation and e volut ion history 
of galaxies, see for example IWhite et all \2m% or ISimonI 
(2005^). 

Along with this motivation, one aim of this paper is to 
measure the relative clustering of red (early-type) and blue 
(late-type) galaxies for different epochs in terms of the linear 
stochastic biasing parameters. These biasing parameters re - 
quire 2"''-order clustering statistics (|Dekel fc Lahavlll999l ). 
That parametrisation is a completely model-independent, 
albeit in a statistical sense for non-Gaussian random fields 
incomplete, measure for comparing two random distribu- 
tions. It quantifies as function of angular scale the relative 
clustering strength of two galaxy types and the correlation 
of their number densities. 

The machinery that is applied here to study galax y bias- 
ing is the ape rture statistics a s form alised in , Schneider et al.l 
(| 19981 ) and Ivan Waerbeki { 19981 ). It is convenient for 
analysing weak lensing data and, in particular, to mea- 
sure the linear stochastic galaxy bias as a function of scale 
(|Simon et al.l l2007': 'Ho ekstra et al.1 1200 2') . In order to have 
a compatible statistical measure that quantifies the relative 
bias between red and blue galaxies the formalism is slightly 
extended. 

Another aim of this paper is to give a physical 
interpretation of the relative clustering. For that pur- 
pose, we use the measurements for setting constraints 
on paramet ers of galaxies within the framework of a 
halo model (van den Bosch et al. '2007'; 'Zheng et al.' '20051: 
Berhnd & Weinberg 2002; Scoccimarro et al. 2001; Scljay 
2OOOI : iPeacock fc Smith! |2000| ). In this context, we intro- 
duce and discuss a new parameter - the correlation factor 
of the joint HOD - that regulates the likelihood to find a 
certain number of red and blue galaxies within the same 
halo. This allows us to investigate whether two galaxy pop- 
ulations avoid or att ract each o ther i nside / inside the same 
dark matter halo. In IScrantonI (I200I |2002| ) a similar mod- 
elling is carried out to explain the relative clustering of 
red and blue galaxy samples, however assuming for sim- 
plicity uncorrelated gala xy numbers inside same haloes. In 
ICoUister fc Lahavl l|2005^ also the clustering of red and blue 
galaxies was studied by looking at the projected galaxy den- 
sity profiles of groups. 

For the scope of t his analysis, the COMBO-17 Survey 
(|Woff et al.ll2004l .[ 200 ll ) offers an unique opportunity. It pro- 
vides one of the so far largest deep galaxy samples in the red- 
shift regime 0.2 ^ z ^ 1.1 covering an area of ~ 0.78 deg^, 
observed in five broad-band and twelve narrow-band filters. 
Based on the photometry, photometric redshifts of galax- 
ies brighter than i? ^ 24 mag have been derived within a 
few percent accuracy as well as absolute rest-frame lumi- 
nosities and colours. We are analysing the data from three 
COMBO-17 patches which are known as Sll, A901 and 
CDFS (also known as AXAF). 

The survey has also been designed to fit the 
requirements of gravi t ational lensing applications 
llKleinheinrich et al.1 l2006l : iBrown et aP l2003l : ICrav et all 
12002! ) . The coherent shear distortions of images of 
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background galaxies can therefore be used to infer 
the relati on between galaxy an d mat ter distribution 
as well ()Bartelmann fc Schneider] I2OOII I. We use this 
additional piece of information to further constrain 
parameters of the halo-model by cross-correlating the 
COMBO-17 galaxies with the corresponding shear cat- 
alogues ta ken from the Garching-Bonn Deep Sur vey 
(GaBoDS) l|Hetterscheidt et alJlioO^lErben et al.ll2005l ). 

The structure of this paper is as follows. Sect.[2]outlines 
the quantities that are used to measure the angular galaxy 
clustering, the aperture A/'-statistics. Sect. [3] introduces the 
COMBO-17-survey and GaBoDS which are the sources of 
the galaxy samples, red and blue galaxies, and shear cata- 
logues for this study, respectively. Sect. 4 is the place where 
we describe and present the details of our clustering anal- 
ysis and compare the results to the literature. The cosmic 
shear information is harnessed for the GGL, Sect. [5] quan- 
tifying the typical matter distribution about the galaxies in 
our COMBO-17 samples. Sect. 6 outlines the halo model 
which is then used to interpret the (relative) clustering and 
the GGL signal of the blue and red galaxy samples. In par- 
ticular, we introduce and discuss the correlation factor of 
the joint HOD of two galaxy populations. We finish with a 
summary in the last section. 

Unless stated otherwise we use as fiducial cosmology 
a ACDM model (adiabatic fluctuations) with f2ni = 0.24, 
fib =0.0416, J7A = l-r2m and J/q = ^ 100 kms"^ Mpc"^ 
with h — 0.732. The normalisation of the matter fluctua- 
tions within a sphere of radius 8 h~^Mpc at redshift zero is 
assumed to be erg = 0.76. For the spectral index of the pri- 
mordial matter power spectrum we use rig — 0.96. These 
values are consisten t with the third-year WMAP results 
jSpergel et al.ll2007l ). 



2 CLUSTERING QUANTIFIERS 
2.1 Aperture statistics 

The statistics used in this paper to quantify galaxy clus- 
tering is the so-called aperture number count statistics. It 
originally stems from the grav itational lensing literature. As 
shown in ISimon et al.l (|2007l ) the aperture number count 
statistics is useful for studying galaxy clustering even out- 
side the context of gravitational lensing. Its advantage is 
that no correction fo r the integral- constraint of the angular 
correlation function (|Peebleslll980l ) is needed. 

The aperture number count, A/'(0,&ap), measures the 
fluctuations, excluding shot-noise from discrete galaxies, of 
the galaxy number density by smoothing the density with a 
compensated filter u, i.e. J dxxu{x) = 0: 



\0 - e'\ 



(1) 



where 77(6) and fj denote the (projected) number density of 
galaxies in some direction 6 and the mean number density of 
galaxies, respectively. The variable Sap defines the smoothing 
radius of the aperture. 

One focus of our analysis is the 2"''-order statistics of 
galaxy clustering. All information on the 2"'^-order statistics 
is comprised by second moments of the aperture number 
counts as function of 6a_r,: 



Af, {e.p)Afj (Sap) ) = 27r y dienj (e) [i {le^p)]' , 

where I{x) is a filter kernel 

I (x) = I ds s u (s) Jo {s x) 
Jo 

to the angular power spectrum Pij{£) defined by 
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In this equation, fji is the mean number density of galaxies 
on the sky, for possibly different galaxy samples. We use 
Jo (a;) for the O^'-order Bessel function of the first kind and 
5u {x) for the Delta-function. The tilde on top of the galaxy 
number density fj denotes the angular Fourier transform of 
ri assuming a flat sky with Cartesian coordinates which we 
do throughout this paper: 



I d'erjie)e+"> ' ; 7?(0) = J 



(27r)2 



(5) 



For i= j, Pijii) is an auto-correlation power spectrum, a 
cross-power spectrum otherwise. 

To weigh density fluctuatio ns inside apertu res we use a 
compensated polynomial fllter (ISchneiderlll998l ) 



u (x) 
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^ -x^ 1 H(l-x) , 



(6) 



which by definition vanishes for x ^ 1; H(x) denotes the 
Heaviside step function. The filter has the effect that only 
galaxy number density fluctuations from a small range of an- 
gular scales contribute to the signal; it acts as a narrow-band 
fllter for the angular modes, £, with highest sensitivity to 
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,4 1' 



Therefore, the 2'^'^-order A/'-statistics are 



essentially a probe for the (band) power spectrum Pij{£c). 

In practice, the A/'-statistics are easily derived from the 
two- point correlatio n functions, (jj{9), of galaxy clustering 
(e.g. ISimonI |2005b') by a weighted integral. As an aside, 
this is very similar to the approach recently advocated in 
IPadmanabhan et al.l (|2007f ) which points out that weighting 
io{9) with a compensated fllter can also be useful for depro- 
jecting uj{6) to obtain the 3D-correlation function on small 
cosmological scales. 



2.2 Linear stochastic biasing parameters 

The linear stochastic biasing parameters (|Dekel fc Lahavl 
ll999l : lTegmark fc Peebl3ll998l ). expressed here in terms of 
the A/'-statistics, quantify the relative clustering of two ran- 
dom fields, which are in our case the number density of blue 
and red galaxies: 



6(fa 



\ (A/?.uc(eap)) ' 

A^rcd (Sap) Abluc (Sap ) 



y(A/?ed(eap))(AA2^,(eap) 



(7) 



(8) 



Note that the aperture number count vanishes on av- 
erage, (A/') = 0, due to the compensated filter used. 
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Galaxy samples unbiased with respect to each other have 
r(9ap) = b{9ii,p) = 1. The parameters are complete for Gaus- 
sian random fields, which is only approximately true for 
large scales but clearly wrong for non-linear scales, effective 
scale smaller than ~ 8Mpc//i. Owing to the incomplete pic- 
ture those biasing parameters convey for the non-Gaussian 
regime, one is unable to distinguish stochasticity from non- 
linearity in the relation between the two random fields. 
Therefore a bias factor r 7^ 1 can mean a stochastic scatter 
between galaxy n umber densities or a non -linear but deter- 
ministic mapping (|Frv fc Gaztanagalll993l l between number 
densities - or both. Higher-order statistics or non-Gaussian 
mod els for the cluster i ng are required to make this distinc - 
tion (|Wild et al.ll2005l : iBlantoiilbOOOl : iDekel fc Lahavll 19991 ). 

Note that the parameter r{9s.p) can be larger than unity 
because shot-noise contributions to the variances in the 
aperture galaxy number count are subtracted, or put an- 
other way, spatial shot-noise in the fiuctuation power of the 
ga laxy number density fi elds is automatic ally subtracted as 
in iGuzik fc SeliakI (|200lD or ISeliakI l|2000l ). The underlying 
assumption is that galaxies trace a general galaxy number 
density field by a Poisson sampling process (Poisson shot- 
noise), which is widely assumed in large-scale structure stud- 
ies and, in fact, in the definition of the clustering correlation 
function uj{9). 

We employ the linear stochastic biasing parameters here 
in order to quantify, without too many assumptions, the 
relative biasing of our blue and red sample as function of 
scale, Sap. A more sophisticated and physical, albeit very 
model-dependent, interpretation of the relative biasing is 
given within the framework of the halo-model, see Sect. 6. 



3 DATA SET 

This study is based on three fields: the Sll, A901 and CDFS. 
The observations of the fields were obtained with the Wide 
Field Imager (WFI) of the MPG/ESO 2.2m telescope on La 
Silla, Chile. The camera consists of eight 2 k x 4 k CCDs with 
a pixel size of 15 /xm, corresponding to a pixel scale of 0'.'238 
in the sky. The field-of-view in the sky is 34' x 33'. 

The data from two surveys, carried out with the same 
instrument, are used. We select blue and red galaxies, pos- 
sessing photometric redshifts, from the COMBO-17-survey. 
These data sets are further subdivided into four redshifts 
bins covering the range between z — 0.2 and z = 1.0. Shear 
catalogues from another survey, GaBoDS, covering the same 
patches on the sky as COMBO-17, are utilised to quantify 
the relation between the total (dark) matter density and 
the galaxy positions by using the gravitational lensing tech- 
nique. The following sections describe the details of the two 
surveys and the extracted galaxy and shear catalogues. 



3.1 Red and blue galaxy samples: COMBO-17 

The observations and data reduct ion of the COMB O-17 
survey are described in detail in IWolf et al.l (|200ll ) and 
IWolf et all l|2003i ). Overall the total survey consists of four 




Figure 2. Histogram (arbitrary normalisation) of photometric 
redshifts of galaxies with mjj ^ 24 mag in COMBO-17 (solid 
line). The mean photometric redshift is 2 = 0.68. The sample 
is split into four distinct photo-z bins. The photo-z distribution 
inside the bins is further convolved with the photo-z error of the 
individual galaxies to estimate the true redshift distribution inside 
the photo-z bins (dashed: 2 = 0.3; dashed-dotted: 2 = 0.5; dotted: 
2 = 0.7; dashed-dotted-dotted: 2 = 0.9). 



different, non-contiguous fields observed in 17 optical fil- 
terfl 

The photometric information was used to derive pho- 
tometric redshifts of galaxies with niR < 24 mag, based 
on a set of gala xy spectrum templates (see references in 
I Wolf et aLlliooi ). The quality of the estimate depends pri- 
marily on the apparent magnitude of the object. As estima- 
tor for the redshift uncertainty we use Eq. (5) of lWolf et al.l 
l|2004l ): 



: 0.007(1 + z)Vl + lQ0.8('"R-21.6mag) 



(9) 



where is the 1-a standard deviation of the object redshift. 
Fig. [2] shows the frequency distribution of the photometric 
redshifts (solid line) of the full sample. 

Based on photometry, rest-frame colours with accuracy 
5m ~ 0.1 mag and absolute luminosities with accuracies 
5m ~ 0.1 mag (0.2 mag) for redshifts z > 0.5 (~ 0.3) were 
calculated. 

Our object catalogue consists of galaxies with reliable 
photometric redshifts. Galaxies are only contained in the ob- 
ject catalogue if both spectral classification and estimation 
of the photometric redshift has been successful. Therefore 
there is a certain probability with which a galaxy of some 
absolute magnitude, redshift and template spectrum (SED) 
cannot be identified. This means that the galaxy sample 
is incomplete. The completeness of COMBO-1 7 has been 
studied using extensive Monte Carlo simulations (|Wolf et al.l 
120031 ) and has been found to be a complex function of galaxy 
type and redshift. Roughly, the completeness is about 90% 
for m_R < 23 mag and about 50% near m_R ~ 23.8 mag (blue, 
late-type galaxies) or near m_R « 23.5 mag (red, early-type 
galaxies) . 

We split the total object catalogue into four distinct 



The filters include UBVRI and 12 medium-band filters. 
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Figure 1. Rest-frame U-V colour vs. absolute magnitude in V-band for the Sll galaxies inside four different redshift ranges (numbers 
at bottom). Analogous colour-magnitude diagrams for the fields CDFS and A901 look similar. The galaxy sample is split into a red and 
blue-cloud population by the red-sequence division line "RS" . The steep line "CUT" is an additional cut applied to obtain comparable 
absolute magnitude-limits in all bins. 



Table 1. Number of galaxies for all redshift bins and survey fields. Only the mean redshifts of the redshift bins 0.2 ^ z < 0.4, 0.4 ^ z < 0.6, 
0.6 ^ ^ < 0.8 and 0.8 z < 1.0 are quoted. The quoted fractions are red and blue galaxies (from COMBO-17) relative to the total 
number of galaxies in the corresponding redshift bin. The comoving number densities derived from the galaxy numbers and the estimated 
comoving volume of a z-bin and field are corrected for incompleteness. The statistical errors for the individual fields (2(t) are estimates 
assuming (uncorrelated) Poisson errors for the absolute galaxy numbers; errors do hence not include cosmic variance for individual fields. 
The last column contains the number of galaxies (sources) in the shear catalogue (from GaBoDS), with 21.5 ^ R < 24.5 mag, used for 
the lensing analysis. The source galaxies have a mean redshift of 2 = 0.78. The bottom block of the table contains values averaged over 
all three fields. The errors (Icr) are now Jackknife-estimates, reflecting cosmic variance derived from the field-to-field variances. 

Field z #RED Fraction density #BLUE Fraction density #Sources 

[lO'^h^Mpc^^] [lO-^/i^Mpc-^] (GaBoDS) 



A901 0.31 297 0.22 ± 0.02 10.7 ±0.6 1045 0.78 ± 0.02 38.3 ± 1.2 17084 

0.50 397 0.24 ±0.02 6.2 ± 0.3 1264 0.76 ±0.02 20.0 ± 0.6 

0.71 413 0.17 ±0.01 3.5 ±0.2 2046 0.83 ± 0.01 18.2 ±0.4 

0.89 376 0.13 ±0.01 2.1 ±0.1 2421 0.87 ± 0.01 12.8 ± 0.3 



CDFS 0.31 118 0.16 ±0.02 4.5 ± 0.4 621 0.84 ± 0.02 22.4 ± 0.9 20487 

0.50 338 0.18 ±0.01 5.2 ± 0.3 1516 0.82 ± 0.01 23.5 ±0.6 

0.71 433 0.16 ±0.01 3.8 ± 0.2 2323 0.84 ± 0.01 20.5 ±0.4 

0.89 154 0.11 ±0.01 0.8 ±0.1 1263 0.89 ± 0.01 6.8 ± 0.2 



Sll 0.31 280 0.25 ±0.02 10.4 ± 0.6 843 0.75 ±0.02 30.3 ± 1.1 18996 

0.50 402 0.21 ±0.01 6.4 ± 0.3 1518 0.79 ±0.01 23.7 ±0.6 

0.71 393 0.18 ±0.01 3.5 ±0.2 1832 0.82 ± 0.01 16.3 ± 0.4 

0.89 367 0.14 ±0.01 2.0 ±0.1 2334 0.86 ± 0.01 12.4 ± 0.3 



COMBINED 0.31 . 0.21 ± 0.08 8.5 ±3.0 . 0.79 ±0.08 30.3 ± 6.9 

0.50 . 0.21 ±0.05 5.9 ±0.6 . 0.79 ±0.05 22.4 ± 1.8 

0.70 . 0.17 ±0.02 3.6 ±0.2 . 0.83 ± 0.02 18.3 ± 2.8 

0.89 . 0.13 ±0.03 1.6 ±0.6 . 0.87 ± 0.03 10.7 ±2.9 



photo-z bins, namely a) 0.2 ^ 2: < 0.4, b) 0.4 ^ z < 0.6, c) 
0.6^ z < 0.8 and d) 0.8 ^ z < 1.0. The mean redshifts of 
galaxies belonging to a)-d) are z = 0.3, 0.5, 0.7, 0.9, respec- 
tively. The sizes of the samples are listed in Table [T] 

In order to have a better estimate for the true red- 
shift distribution, the photo-z distribution of every bin is 
convolved with the photo-z error (Gaussian errors) of the 



individual galaxies, Eq. ([9]). The average redshift uncertain- 
ties are = 0.02,0.04,0.06,0.08 for the samples a)-d), re- 
spectively. See Fig. [2] for the resulting distributions. Obvi- 
ously, the true redshift distribution is wider than the photo- 
z distribution. Ignoring this effect would lead to a system- 
atic un der-estimation of th e galaxy clustering amplitude by 
~ 20% (|Brown et al.ll2008h . 
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The galaxy samples are further subdivided by applying 
a cut in the rest-frame U — V ys. Mv — 5 log^Q h CMD 
(Johnson filter) along the line 

{U-V)(Mv,z)= (10) 
1.15 - 0.31 z - Qm{Mv - 5 logjo ft + 20) . 

This model-inde pendent, empirical cut has been chosen by 
iBell et al] (|2004l ) to study red galaxies near the galaxy red- 
sequence. It slices the bimodal distribution of galaxies in 
the CMD between the two modes. Galaxies redder than 
{U — V){Mv,z) are dubbed "red galaxies", "blue galax- 
ies" otherwise. For the redshifts considered here, most of the 
red galaxies selected this way are morphologically early-type 
with dominant old stellar populations, while blue galaxies 
are mainly late-type, star-forming galaxies. 

The redshift dependence of the CMR zero-point, in 
Eq. (|10|l . was fitted to match the colour evolution of 
COMBO-17 early-type galaxies and to be consistent with 
the SDSS CMR zero-point at low redshift. From the view- 
point of the COMBO-17 early-types the zero-point for red- 
shifts 2 < 0.2 is slightly too low, giving a small contamina- 
tion of the red sample with blue cloud galaxies. Since we 
consider only galaxies starting from z ^ 0.2, this contami- 
nation is negligible for this study, though. 

As we took only galaxies with reliable photometric red- 
shifts, we have as further selection rule ma ^ 24 mag. The 
distribution of our samples in a rest-frame CMD is plotted 
in Fig. [T] Obviously, in the lowest redshift bin CMD galax- 
ies populate faint regions in the diagram that are excluded 
in the other redshift bins due to the survey flux- limit. We 
estimate that in the three deeper redshift bins galaxies have 
roughly to be brighter than (rest-frame) 

Mv-51ogio/iRi -17mag- (U- V) (11) 

in order to be included (see steep black lines in Fig. [l|. 
To acquire comparable galaxy samples at all redshifts we 
artificially apply this limit as cut to all redshift bins. After 
applying this cut, the galaxy samples of all redshift bins 
have comparable absolute rest-frame Mv luminosities. The 
red sample has an average of {Mv) ~ —20.0 ±0.1 mag, the 
blue sample (Mv) = -18.8 ± 0.3 mag (for h = 1). 

In contrast to the red-sequence cut, this luminosity cut 
does not take into account the colour/luminosity evolution 
of the samples but is placed at the same position of the rest- 
frame CMD-diagram at all redshifts. We therefore expect 
the selected galaxy populations of all redshifts not to be 
totally equivalent at the faint end. For an eas ier comparison 
with the literature, e.g. iBrown et al.l (|2008t ). Fig. [3] shows 
also the distribution of rest-frame Ms-magnitudes of the 
various samples. The samples become bluer with increasing 
redshift which is, at least partially, explained by the passive 
evolution of the stellar populations. 

We determine the absolute number density, 0tvpe, of our 
galax y samples by using the Vmax-estimator (e.g. lFried et al.l 
I2OOII ). This estimator needs to be slightly modified since 
we are not selecting the galaxies from a top-hat redshift 
range. Instead we are selecting, for every photo-z bin, galax- 
ies in redshift with a probability proportional to p{z), which 
is the aforementioned photo-z distribution convolved with 
the photo-z error; p{z) resembles only for the lowest z-bin 
roughly a top-hat selection window, see Fig. (2] Due to the 
uncertainty in redshift we also select galaxies from a red- 



shift range (volume) larger than the photo-z window would 
imply. Not taking this effect into account would mean to 
over-estimate the galaxy number density. Assuming that we 
are selecting, apart from the incompleteness e xpressed by 
the in completeness function C{z, SED, U — V, R) (|Wolf et al.l 
l2003f ). all galaxies at the redshift of maximum j)(z) , Pmax say, 
the Vmax-estimator is: 

<t>typc =5^7^; O"^(0typo) = 5Z 772 ' (12) 

V, = (13) 
o r°° civ 
/ d2p(2)— — C(2,type,)7.-V;,i?0 • (14) 

Pmax Jo dildz 

Therefore, p{z) is used here to correct the incompleteness 
function C for our additional galaxy redshift selection crite- 
rion. By we denote the survey area of a COMBO- 17-patch 
which is f2 = / X (39.7arcmin) with a filling factor /, es- 
timated from the patch masks, of / = 0.56, 0.54, 0.55 for 
A901, CDFS and Sll, respectively. The estimator a"^ {4>ty-pc) 
is used for the Poisson shot-noise error of 4'typc- For a top-hat 
selection function p{z), one obtains the estimator mentioned 
in Fried et al. 

3.2 Cosmic shear data: GaBoDS 

The data reduction of GaBoDS was performed with a nearly 
fully automatic, stand-alone pipeline which we had devel- 
oped to reduce optical and near infrared images, especially 
those taken with multi-chip cameras. Since weak gravita- 
tional lensing was our main science driver, the pipeline al- 
gorithms were optimised to produce deep co-added mosaics 
from individual exposures obtained from empty field obser- 
vations. Special care was taken to achieve an accurate as- 
trometry to reduce possible artificial PSF patterns in the 
final co-added images. For the co-addition we used the pro- 
gramme ElSdri zzle. A detai l ed de scription of the pipeline 
can be found in lErben et"al] l|2005h FI 

The shape of galaxies is infiuenced by the anisotropic 
PSF. In order to obtain unbiased shear estimates from ob- 
served sour ce galaxy ell i pticit ies we use the so-called KSB 
algorithm (|Kaiser et al.l [l99^. For a detailed description 
of our implementation of the K SB-algorithm and ca t alogu e 
creation we refer the reader to iHetterscheidt et al.l l|2007l ). 
Our KSB-algorithm pipeline was blin d-tested with simu- 
lated data within th e STEP project l|Massev et all l2007l : 
iHevmans et al.ll2006l l. 

For the PSF-anisotropy correction we utilise stars which 
are point-like and unaffected by lensing. By using a sample 
of bright, unsaturated stars, we measure the anisotropic PSF 
with a Gaussian filter s cale matched to the si ze of the galaxy 
image to be corrected (|Hoekstra et al.l|l998h . In the case of 
the WFI@2.2m instrument the PSF of the co-added images 
varies smoothly over the total field-of-view. Therefore we 
perform a third-order two-dimensional polynomial fit to the 
PSF anisotropy with 3.5 cr-clipping as a function of position 
over the entire field-of-view. With this fit it is possible to 
estimate the PSF anisotropy at the position of the galaxies. 

^ The THELI pipeline is freely available under 
f tp: //f tp. ing. iac . es/mischa/THELI . 
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Figure 3. Frequency distribution of (rest-frame) B-band magnitudes, separated for red (left) and blue (right) galaxies, for the four 
different photo-z bins (dashed lines; z = 0.3; dashed-dotted; z = 0.5; dotted: z = 0.7; dashed-dotted-dotted: z = 0.9). The corresponding 
mean magnitudes and standard deviations in order of increasing mean redshifts are {h = 1): (Mb) = —19.20 ± 0.94mag, —19.56 ± 
0.91 mag, -20.06 ± 0.84 mag, -20.49 ± 0.68 mag (red) and (Mb) = -18.34 ± 0.88 mag, -18.96 ± 0.94 mag, -19.46 ± 0.83 mag, -19.83 ± 
0.69 mag (blue). 



All objects for which problems concerning the deter- 
mination of shape or centroid position occur are rejected 
(e.g. objects near the border, with negative total flux, with 
negative semi major and/or semi major axis). In addition, 
we only use those objects with a half-light radius which is 
larger than that measured for stars and an ellipticity (af- 
ter PSF correction) of less than 1.0. Additionally, we only 
use those galaxies with a signal-to-noise ratio larger than 
five, since a comparison of ground- and space-based data 
showed that galaxies with a lower S/N do not contain any 
shear informat ion (Hetterscheidt e t al., in prep.). We adopt 
the sc heme in lErben et al.l (1200 ll ) and iHetterscheidt et al.l 
l|2005l ) to estimate uncertainties for each galaxy ellipticities. 
In the following lensing analysis galaxies are weighted with 
the inverse of the square of the estimates variances. 

A powerful way to reveal possible systematic errors in 
the PSF-correction is the application of the aperture mass 
statistics as it provides an unambiguous splitting of E- and 
B-modes: 
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The Map-statistics quantify the fluctuations of the shear 
signal (E-mode: plus sign on r.h.s, B-mode: minus sign) 
within an aperture of radius Sap. They can be obtained 
by t ransforming the shear-she ar correlation functions ^± 



(e.g. Hetterscheidt et al] |2007| ) . We use T± as derived in 



Figure 4. Various measurements of the aperture mass statis- 
tics of the three COMBO-17-fields in H-band {r^ 0.75 deg^). All 
galaxies in the magnitude range R G [21.5, 24.5] are used for the 
cosmic shear analysis. The effective number density of galaxies 
is 16 arcmin"^. The error-bars denote the field-to-field variance 
between the three fields. Upper panel: E- and B-mode decompo- 
sition of the Map-signal. Lower panel: Cross-correlation between 
uncorrected stars- and corrected galaxies for E- and B-modes. 



ISchneider et all (|2002| )." The presence of non-vanishing B- |2004| : iHevmans et a l 



1 



modes is a good indicator for systematics arising, for in- 
stance, from an imperfect anisotropy correction. 

However, there are several possible astronomical 
sources of B-modes, like the i ntrinsic alignment of galax - 
ies (e.g. iHeavens et al.l l2000l : iHevmans fc Hea vens' '2003'), 
the intrinsic shape-shear correlation (e.g. iHirata fc Seliala 



2006) and th e redshift clustering 

of source gala x ies ( Schneid er et al.l ^002). Furthermore, 
iKilbinger et al.l l|2006l) found in their work a mixing of E- 
and B-modes due to a cut off in ^± on small angular scales. 
However, all those B-mode sources are expected to be much 
smaller than the statistical errors of the three fields and are 
therefore irrelevant in the following analyses. 
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A further method to check for systematics is the cross- 
correlation between P SF-uncorrected st ars and anisotropy 
corrected galaxies (e.g. lBacon et al]l2003l ). For that purpose, 
shear-shear cross-correlations, between star-ellipticities 
and galaxy-ellipticities are computed and transformed ac- 
cording to Eq. (|15p . We denote the thereby obtained A/ap- 
variances as Afcross;E and Afcross;B for the E- and B-modes, 
respectively. 

In Fig. |4] the average E- and B-mode signal and the 
average signal of the cross-correlation between anisotropy- 
corrected stars and uncorrected galaxies, Afcross;E and 
Afcross;B, are display ed (further details on M cro33:E and 
Mcross;B are given in iHetterscheidt et al.llioOTi '). The mea- 
sured B-mode signal is consistent with zero within the 1 a- 
range for 60 > 2' , and the cross-correlation between uncor- 
rected stars and corrected galaxies, Afcross;B, is consistent 
with, or close to zero. Hence the B-mode signal does not in- 
dicate an imperfect anisotropy correction. Additionally, the 
cross-correlation signal, Mcross;E is consistent with zero. 

Taking the B-modes and the cross-correlation signals 
A/cross;E and AfcrossiB iuto accouut we conclude that the in- 
fluence of systematics on the calculated E-mode signal is 
negligible compared to statistical errors. 



4 RELATIVE BIASING OF RED AND BLUE 
GALAXIES 

4.1 Combining measurements 

We outline here how measurements of the same quantity 
in the A^'p = 3 different fields were combined, and how the 
covariance of the combined value was estimated. 

The quantities estimated from the data, as function of 
galaxy-galaxy separation on the sky, are the angular cluster- 
ing of the galaxy samples, in terms of the aperture statistics 
(auto- and cross-correlations), and later on the mean tan- 
gential shear around the galaxies. The measurements are 
binned into five logarithmic angular bins and compiled as a 
data vector di , where i = 1 ... 3 is an index for the survey 
field (either A901, Sll or CDFS). 

A commonly applied technique for combining all mea- 
surements and estimating the covariance of the combina- 
tion is by looking at the field-to-field variance of di (e.g. 
IHetterscheidt et al.ll2007h . Applying this technique to mere 
thr ee fields, however, p oses problems that are unsolved so 
far iHartlap et al jioor) and would bias the final results. We 
therefore use a different approach here. 

For each field i individually, the measurement is re- 
peated A^b = 100 times on bootstrapped data, which is ac- 
quired by randomly drawing galaxies (with replacement) 
from the original samples. The size of the bootstrap samples 
equals the the size of the original sample. For a recent pa- 
per on_this_and_related statistical tools for error estimation 
see lNorberg et aP l|2008l ). which points bootstrapping out as 
appropriate, albeit conservative (errors are overestimated by 
~ 40%), method for uncertainties in two-point correlation 
functions.. The data vector of the j-th bootstrapped sample 
of the i-th field is denoted by dij . The variance of di among 
the bootstrap samples yields an estimate for the covariance 
of the statistical errors in di due to galaxy shot noise: 



1 

C» = ]\j 5Z ('^'^ ~ ('^■'J ' 



(16) 



The most likely value of a combined d of all fields, con- 
strained by all individual di and their covariances d, is 

-1 



(17) 



obtained by finding the minimum of d in the negative log- 
likelihood (assuming Gaussian errors): 



dd 



= ■ x\d) = Y, {d, - df [Zi\-^ (d. - d) . (18) 



This is the generalisation of the well-known rule to combine 
measurements by inversely weighting with their statistical 
error. We consider the assumption of Gaussian statistics for 
the likelihood function as valid approximation for the fol- 
lowing reasons: 

• The statistical errors of the angular clustering estima- 
tor, outlined below, are known to be Poisson, hence closely 
Gauss ian for a not too smal l number of galaxy pairs inside 
a bin (|Landv fc Szalavlll993l ). Since we linearly combine dif- 
ferent, little correlated, angular bins of the clustering esti- 
mator to obtain the A/'-statistics, the A/'-estimates are even 
more Gaussianly distributed according to the central limit 
theorem of statistics. 

• As for GGL, the complex ellipticities of the lensed 
sources obey roughly a bivariate Gaussian distribution 
which makes the estimate for GGL inside an angular bin 
also approximately Gaussianly distributed. 

Even if the noise in the data does not obey Gaussian statis- 
tics, the l.h.s. estimator Eq. (|17|) yields an unbiased however 
not optimal (maximum likelihood) estimate of d, with C as 
covariance, as any weighted average of unbiased estimates 

di is itself an unbiased esti mator. 

As pointed out by Hartlap et al.l (|2007| ) taking the in- 
verse of the (bootstrapped) Ci gives a biased estimate of the 
inverse. To obtain an unbiased estimator of the inverse we 
multiply Ci in Eq. ([T5)) by the factor: 



(19) 



where p is the size of the vector d (here: p — 15, five an- 
gular bins for (A/Jid), (J^hiue), (A/'redA/'biuo) , respectively; for 
the halo-model fit later on we will extend di by ten more 
components comprising the GGL signal of the samples) . 
The covariance of the combined mean d is simply 



{dd') - = C 



(20) 



This covariance does not contain an estimate of the cosmic 
variance, though, because it is solely based on bootstrap- 
ping using individual fields (galaxy shot-noise) and does not 
include a field-to-field variance between the fields. 

This can be seen by considering a toy example which 
has just one bin, di, for the field data vectors and equal 
(co)variance for all fields, d = say. The above equations 
tell us for that case that d is the arithmetic mean of all di, 
and the combined variance is C = a^N-^. Based on this, if 
we had an infinite number of galaxies within each field, i.e. 
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Figure 5. Correlation of statistical errors of the aperture number 
count statistics - {^hl-ael' (MilueA/'rod ) and {M'^^^ for five aper- 
ture radii each - and the GGL signal - (7t) for the blue and red 
sample — for a mean redshift of 2 = 0.7 as obtained from boot- 
strapping the data. The correlation matrix for the other redshift 
bins look similar. 



bootstrapped cr 0, we would expect a "perfect" measure- 
ment with C = 0. Hence, C completely ignores the possibil- 
ity that di 7^ dj for i ^ j even with a hypothetically infinite 
number of galaxies inside each patch (cosmic variance). 

We expect the actual uncertainty therefore to be some- 
what higher than expressed by C, especially for the larger 
separation bins (larger aperture radii) where cosmic vari- 
ance errors are becoming larger with respect to shot-noise 
errors. The fact that the bootstrap variances overestimate 
the covariance presumably compensates this deficit to some 
extend. 



4.2 Clustering of red and blue galaxies 

4.2.1 Method 

The A/'-statistics is derive d from the an gular clustering of 
the galaxy samples, io{9) ( Peebles! flQSCf ). by a linear trans- 
formation l|Simon et al.ll2007l ) 



where 

T+ {x) ^ {2nf 



ds s [I(s)]^ Jol 



(22) 



The indices i and j are used to denote the different galaxy 
samples. For estimating the angular clustering of a single 
sample, i — j, we use the standard method of counting the 



number of galaxy pair within a certain separation, namely 
pairs of galaxies from the same sample, (DD) , pairs of galax- 
ies from a random mock sample and the COMBO-sample, 
(DR) , and pairs between galaxies from the same mock sam- 
ple, {RR} iLandv fc Szalavlll993l ). The number of pairs in- 
volving random mock samples is averaged over 50 mock re- 
alisations for each correlation function. 

For the random catalogue, we assume that the com- 
pleteness of the galaxies inside a redshift is homogeneous, 
so that the only relevant parameters for the random mocks 
is the number of galaxies and the masking, which is applied 
for all galaxies equally. 

The cross-correlation function, i ^ j, is comp uted im- 
plementing the estimator (|Szapudi fc Szalavlll997f ): 



{D,D,) {D,R,) + {DjR,) 



+ 1 



(23) 



(RiRj) (RiRj) 

for which counting the number of pairs between different 
galaxy samples, Di/j, and different mock samples, Ri/j, is 
required. For example, (RiDj) denotes the number of pairs 
between galaxies from a i-mock catalogue and galaxies j 
within a certain 8 separation interval. Note that the size of 
the mock sample Rk is the same as the size of the galaxy 
sample Dk- 

Traditionally, galaxy clustering is studied using the an- 
gular correlation function w{8). For a comparison of our 
results for the two-point statistics of galaxy clustering in 
COMBO-17 with the literature, we infer the angular corre- 
lation func tion from the A/"-sta tistics by applying the method 
outlined in [Simon et al.l(|2007l ). This method allows us to be 
ignorant about the integral constraint which offsets the esti- 
mates of uj{6) obtained from the aforementioned estimators. 
We parameterise u}{9) as a simple power-law 



(24) 



^(^) = ( Y7 



Aui and 5 are constants. Moreover, we deproject w{0) in 
order to obtain an average SD-correlation function for the 
clustering of the samples, 

?(r) = " \ (25) 

after having made sure that the Limber approximation 
(Peebles 1980; Limber 1953) was vahd here (jsimon 2007); 
the constant ro is the correlation length. 

As the 2D-cor relation functions are not exactly power- 
laws (jZehavi et al.l [2004'l. the foregoing procedure will yield 
parameters for the 3D-clustering which are biased to some 
extent. We do not discuss this effect further but point out 
here that this bias could, in principle, be estimated from 
the halo-model fit, which also predicts the 3D-correlation 
function. 



{M(eap)AA,(eap)) = / dxxLj^j{e^px)T+{x) , (21) ,22 

Jo 



Results 



The combined measurements of the 2"'*-order A/'-statistics 
for blue and red galaxies can be found in Fig. |6] The 
A/'-statistics is binned between O'.l ^ 6'ap < 23' using five 
logarithmic bins. The statistical errors are somewhat cor- 
related which can be seen in Fig[5] The best fits of the 
angular clustering and 3D-clustering parameters are listed 
in Table [2] Averaging over all redshift bins, we find for 
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Figure 6. Variance of the aperture number count, (N^^^ and (A/'^j^^,), for red (filled circles) and blue (filled stars) galaxies. The straight 
lines are the best-fittin g power laws to th e measurements. The thick solid line denotes the theoretical variance for galaxies unbiased to 
the dark matter (as in lSmith et al-lboOSl) . Also plotted is the aperture number count cross-correlation, (A''rcdA/biuc), between blue and 
red galaxies (open stars). The data points are slightly shifted along the abscissa to increase the visibility. 



the red sample A^, = 0.35 ± 0.06, 5 = 0.85 ± 0.10 and 
ro = 5.5 ± 0.9 /i~^Mpc. The corresponding values of the 
blue sample are = 0.08 ± 0.02, & = 0.65 ± 0.08 and 
ro = 3.0 ± 0.4/i~^Mpc. If we combine the red and blue 
sample, we find as average over all redshifts for all galaxies 
Au: = O.lliO.Ol, 5 = 0.79±0.03 and ro = 3.8±0.3 ft'^Mpc. 

In the following, we would like to compare the best-fit 
parameters for the galaxy clustering to the results of other 
papers. Since the data sample selections between different 
surveys are in general not equal, we can only make a crude 
comparison to other results. Typical values for the cluster- 
ing of galaxies, regardless of their colour, a t low redshifts are 
6 ft" ^ Mpc and & 



ro 



4 - 6/t~^Mpc and 5 = 0.6 - 0.9 (ci. iMcCracken et al 
200g'; Hawkins ct al. '20031: [Zehavi et al.ll2005l : lNorberg et al 
2002 : Zehavi c t al. 2003)- Compared to these values our re- 



sults are compatible, although we may have somewhat lower 
values for ro. A lower clustering amplitude may be explained 
by a differen t mean luminosit y of our sample, though. See 
in particular ICoil et al.l (|2008l ) for the dependence of galaxy 
clustering parameters on absolute luminosity. 



Subdividing the galaxy sample of COMBO-17 into red 
and blue galaxies yields different clustering properties: red 
galaxies are more strongly clustered than blue galaxies and 
red galaxies have steeper slo pes 7 than blue ga l axies. Sim- 
ilar cuts have b een done in iMcCracken et al.l (|2008l ) and 
ICoil et al] (I2OO8I ) who find clustering properties in good 
agreement with our measurements up to redshifts of 2 ~ 1. 
Beyond the statistical uncertainties of our measurements we 
do not find a change in clustering of our samples as, for ex- 
ample, reported by the more accurate measurements of Coil 
et al. 



iPhletas et al.l (|2006l ) measured the clustering of red and 
blue galaxies, examining the clustering in redshift space 
within the range of 0.4 ^ z 0.8, with the same data set 
as we do and applying the same red-sequence cut. They 
quote values for the red sample which are comparable to 
our result. The blue sample is somewhat different, though, 
with a marginally higher ro = 3.65 ± 0.25 /i~^Mpc and a 
shallower 5 = 0.45 ± 0.03. We suspect that the difference 
is due to a different magnitude cut in addition to the red- 
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Table 2. The results are for Qm = 0.3, Ha = 0.7. Upper half of table: summary of results (median and 68% confidence levels) obtained 
considering values for (A/''^) out of S S [0.1', 23'] for red and blue galaxies for different mean redshifts, z. Note that the statistical errors 
of 5 and A^j are strongly anti-correlated with a correlation coefficient of about —0.6. Lower half of table: the fit for ijj{9) has been 
transformed into the 3D-galaxy clustering function ^(r) = {r/ro)~^ using Limber's equation. The clustering length tq is in units of 
/i~^Mpc; ^iMpc denotes ^(r) at a comoving length of r = 1 /i~^Mpc. Parameters with a "|<5"-suffix are fits for a fixed slope: S = 0.85, 0.65 
for red and blue, respectively. 
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sequence cut: Phleps et al. selected only galaxies brighter 
than AIb = —18, whereas our magnitude limits are colour 
dependent, Eq. (fTTj) . 



4.3 Relative linear stochastic bias 

4.3.1 Method 

The aim of this section is to constrain the relative linear 
stochastic bias of the red and blue galaxy sample using the 
measurements of the aperture number count statistics. Sim- 
ply applying the definitions, Eq. ((7)l and ((8)l, to the mea- 
surements would probably result in a biased estimate of the 
bias parameters due to the relatively large uncertainty in 
the aperture statistics. A more reliable, but also more elab- 
orate, approach consists in employing Bayesian statistics, 
see Appendix [Cl 

Our combined measurement of the aperture statistics 
for different aperture radii, 9i , is compiled inside the vector: 



d 



P ^ 



,{AAb'luc(eOit.-uo),...,&(eO 



(27) 



(26) 



The covariance of d, C, is worked out according to what 
has been outlined in Sect. 14.11 The measurement d is an 
estimate of the true, underlying aperture statistics. Let 
{■MDiuc(^i)ltruo) be the true aperture number count disper- 
sion of blue galaxieslfl For given linear stochastic bias pa- 
rameters and the true aperture number count dispersion of 
blue galaxies, 



3 Without further assumptions, the best estimate for the true 
A/'-dispersion is {J^^i^^i^)) itself. By putting constraints on the 
relation between the aperture statistics of red and blue galax- 
ies, as we are doing by constraining the parameters of the linear 
stochastic biasing, this can change. 



the expected A/'-statistics of blue and red galaxies, "fitted" 
to the data d, is: 



m{p) 



b\e^){N^iUdi)\t.^.),h\e2){N^iUe2)\u^.), 

r(6»l)&(6'l){AAbluc(6'l)ltruo),r(e2)6(e2){AAbluc(6l2)itruo),..., 
{J^bluc{ei)\truc),{J^Muc{02)\truc},...) ■ (28) 



The A/'-statistics involving the red galaxy population is ex- 
pressed in terms of the blue population statistics and the 
linear stochastic bias parameters. 

Now, the likelihood of the parameters p given the data 
d is for Gaussian errors: 



P{d\p) oc exp ( [d — m(p)]* C ^ [d — m{p) 



(29) 



The posterior likelihood, up to a constant factor, of b(6i) 
and r{6i) given d and marginalised over {Af^iuc(^i)ltruc) is 



P{b{e,),r{9^)\d) cx 

T — 1 



(30) 



P{d\p) 



The probabilities P {b{6i)) and P {r{6i)) are priors on 
the bias parameters which we chose to be fiat within 
fe(6'i) e [0,4], r(6'i) e [0, 1.3] and zero otherwise. The upper 
limits of the priors were chosen to be well above crude es- 
timates for b and r, obtained by blindly applying Eqs. (0 
and ((8]) to the data. The number of aperture angular radii 
bins is A^bin. 

The marginalised posterior likelihood P {h{9i),r{6i)\d) 
of the overall ten variables (ten bias parameters for five 
aperture radii) is most conveniently sampled employing 
the Monte-Carlo Markov Chain (MCMC) technique (e.g. 
iTereno et allbOOSl ). Especially, the marginalisation is trivial 
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Figure 7. Relative linear stochastic bias between the red and blue galaxy sample for four mean redshifts, 6(9ap) is in the left panel and 
7'(Sap) in the right right panel. The shaded area denotes the Icr-confidence region about the mean of all redshifts taken together. The 
effective proper spatial scale for an aperture radius of 10' at redshift z = 0.3,0.5,0.7,0.9 is 2.8, 3.8, 4.5, 4.8 /i^'^Mpc. The lines show the 
best-fitting models for z = 0.7 discussed later on in Sect. [6731 (solid: no central galaxies, dot-dashed: red central galaxies, dotted: red and 
blue central galaxies). The data points are slightly shifted to increase the visibility. 



within this framework. Remember that the size of the data 
vector is p = 15. 



4.3.2 Results 

The Fig. [7] shows the inferred constraints on the relative lin- 
ear stochastic bias of red and blue galaxies. Owing to rela- 
tively large remaining statistical uncertainties, which makes 
all redshift bins indistinguishable, we have combined the sig- 
nal from all redshift bins (shaded areas). 

As already seen in Fig. [S] red and blue galaxies are 
differently clustered with respect to the dark matter and 
therefore have also to be biased relative to each other. This 
difference is equivalent to a relative bias factor of about b ~ 
1.7 — 2.2 with some evidence for a rise towards radii of about 
9a_p ~ 2' and a subsequent decline for even smaller radii. 
The evidence for this scale-dependence is, however, weak. A 
rise is expected, though, because of the different power-law 
slopes of (A/"^) for the two samples. 

For the lowest redshift bin, this is in good agreement 
with iMadgwick et all (|2003l ). their Fig. 4. Furthermore, the 
observed scale-dependence explains why we find an overall 
larger value for the red/blue bias than other authors who 
determined the relative bi as with variou s diffe rent meth- 
ods on larger scales, e.g. IConwav et all (|2005h . b ^ 1.3 
at ISfe-^Mpc [Wild et al.l (|2005l ). b » 1.8 at lOfe-^Mpc 
lyyillrner etal] l|l998l ). b 1.2 at 8/i"^Mpc, IG uzzo et all 
1 19971), fe« 1.7. According to our halo model (Fig.[T2]|, which 



will be discussed in the following sections, we should expect 
a steep decline of the bias factor down to 6 ~ 1.5 beyond our 
largest aperture radius which would reconcile our 6 ^ 2 — 3 
with other studies. 

Within the errors we do not see an evolution of the 



rel ative bias with redshif t as, for example, has been reported 
by Le Fevre et al. (Il99d) . This would support the finding of 
iPhleps fc Meisenheimel (l2003h . 

Recently. ICoil et al.l (fiooa ) have reported for DEEP2 a 
relative bias of 6 ~ 1.28 averaged over spatial scales between 
1-15 /i-^Mpc and b ~ 1.44 for 0.1-15 ft-^Mpc which im- 
plies an increase of the bias for scales smaller than 1 h~^Mpc. 
This also agrees with our finding. 

For the bias parameter r{9ap) we can make out a 
trend of decorrelation, r 7^ 1, between the two samples 
towards small scales starting from about 10', which corre- 
sponds to (proper) ~ 3 — 5 h~^Mpc depending on the mean 
redshift. We estimate that the correlation factor drops to 
r ~ 0.60 ± 0.15 on the smallest measured scale. An evolution 
with redshift exceeding the statistical errors is not visible. 
Therefore, as other authors j_we find a correlation close 



to uirity on large scales (e.g.lWild et al . 2005; Co nway et al.l 
I2OO5I : lBlantonll2000l : iTegmark fc Bromley,, 1999.) that is de- 
creasing towards smaller scales. We expect our data points 
to become eventually consistent with r ~ 1 not far beyond 
^ap ^ 20'. Note, again, that we are probing smaller scales 
tha n the cited authors due to different methods. 

IWang et al.l (|2007l ) studied the cross-correlation statis- 
tics of galaxy samples with different luminosities and colours 
in SDSS. Their results for the cross-correlation between the 
faint red and faint blue sample is consistent with our results. 
In particular, they also find a decrease of r(6la p) towards 
smaller scales (see their Fig. 16, bottom panels) n lCoil et al.l 
(2008) pointed out that below a scale of 1 h'^Mpc the cross- 
correlation function of their blue and red sample drops below 
the geometric mean of the separate auto-correlation func- 



Note that Wang et al. are essentially plotting r 
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tions. This is to say that their correlation factor becomes 
less t han unity on th e se sca les. In this context, see also Fig. 
11 of[s wanson et al where a decorrelation towards 

smaller scales is found for splitting the data set into red and 
blue galaxies. 



5 GALAXY-GALAXY LENSING 
5.1 Method 

To impose further constraints on the following halo-model 
analysis, we additionally measure the mean tangential shear 
of source galaxies (GaBoDS), {7t), as function of separation 

6 about the red and blue lens-galaxies (COMBO-17) for dif- 
ferent redshifts, i.e. 



(7t(e)) 



e(6li)e- 



(31) 



where the average tangential ellipticity over all source- 
lens paiTS_with_se2axati£i^^ = \9i — 02 1 has to be taken 
(cf. iKleinheinrich et al.ll200^ : Zdi,62 is the angle that is 
spanned by 62 — di, the difference vector between source 
and lens position, and the a;-axis. 

The mean tangential shear about lenses can be re- 
lated to the differential projected matter over- density about 
lenses, in excess to the cosmic mean (e.g. iMcKav et al.l 

iml). 



AE(6i) = E(< 6*) - E(6i) = 



(it) (6) 



with 



AtvG I f kiwi),h{ws - Wl) 

./k(Ws) 



(32) 



(33) 



if we specify a fiducial cosmology and the distribution of 
sources (lenses) in distance from the observer, (wi); {. . .) 
denotes the average over the lens and source distribution. 
The function /k(w) is the comoving angular diameter dis- 
tance as function of the comoving radial distance w and the 
curvature, k, of the fiducial cosmological model. By E(< 9) 
we denote the average line-of-sight over-density within a ra- 
dius of 9, the lens is at the disk centre, whereas E(S) is 
the average over-density over an annulus with radius 9. We 
rescale our measurements of (71) with Ecrit in order to get 
rid of the infiuence of the lensing efficiency. This gives us 
comparable quantities for lenses of all redshift bins. 

For the redshift distributi on of the sources we use the 
fit of lHetterscheidt et al.l (|2007l ) to the empirical distribution 
of photometric redshifts as seen in the Deep-Public Survey 
(DPS) (Hildcbrandt ct al. 2006), see Fig. [9] Our shear cat- 
alogue is a sub-sample of the shear catalogue used in that 
study. 

A representative example for the correlation of statis- 
tical errors in the GGL estimate, and the cross-correlation 
between A/'-statistics errors and GGL errors, is given by Fig. 

m 



5.2 Results 

Our measurements for of AE(&) are shown in Fig. |8l The 
data is binned between 6" ^ 9 < 3' into five logarithmic bins 



N 1 




0.5 



1 S 
z 



Figure 9. Distribution of sources belonging to the cosmic shear 
catalogue, as estimated by the photometric redshifts of galaxies 
inside the magnitude interval R e [21.5, 24.5] in the DPS. The 
black histogram shows the empirical distribution, the red his- 
togram is the fit to the data. The empirical data points around 
2 = 1.1 (dip) arc down- weighted in the fit due to systematics in 
the photometric redshifts. The error bars denote the field-to-field 
variance. The mean redshift of the sourc es is z = 0.78. The figure 
is taken from iHetterscheidt et al ] l2003). 



in angular separation. The remaining statistical uncertain- 
ties are high so that almost all measurements are at la 
consistent with zero. However, when combining all redshifts 
bins, we find a slight signal for both the red and the blue 
galaxy sample. The combined red signal is higher than the 
blue signal, roughly by a factor of two or three. This is con - 
sistent with what was found bv Kleinheinrich et al. ||2006|). 



using the same data, [Sh cldon ct al. (2004), McKav et al.l 
()200lh and lGuzik fc SeliakI (120011 ). It imphes that the envi- 
ronment of red galaxies (or the galaxy itself) contains more 
mass than the environment of a typical blue galaxy, or that 
the typical size of a blue lens halo is smaller than that of a 
red lens, although it might have the same mass than the red 
lens halo. 

When fitting HOD parameters to the data, we use the 
GGL signal of the different redshift bins to constrain the 
allowed regime of lensing predictions by our model. Due to 
the large statistical uncertainties of the measurement, the 
GGL mainly serves as an upper limit. 



6 INTERPRETATION WITHIN A HALO 
MODEL FRAMEWORK 

In this section, we translate the clustering statistics and 
GGL-signal of the galaxy samples into halo-model param- 
eters. The halo-model description is utilised to describe 
the 3D-distribution of galaxies and dark matter. We only 
briefiy summarise the halo-model formalism here and refer 
the reader to the literature for the details. How the 3D- 
distributions, expressed by the halo-model, relates to the 
observed projected angular distribution on the sky is dis- 
cussed later on in Sect. 6.3. The reader may find the details 
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Figure 8. DifTerential surface mass density, AS, as function of angular separation about lenses from the red (left panel) and blue (right 
panel) galaxy samples as inferred from GGL. The shaded area denotes the Icr-confidence region about the mean of all redshifts taken 
together. The data points are slightly shifted to increase the visibility. From the lowest to the highest redshift, a separation of 0'.5 
corresponds to a projected comoving distance of 125/i~^kpc, 195/i~^kpc, 265?i~^kpc or 320/i~^kpc, respectively. 



on the dark matter halo properties used for this paper in 
Appendix [X] Note that the notation used therein is intro- 
duced in this section. The Fourier transform of a function 
/(r), denoted by a tilde /(fc), is defined in analogy to Eq. 
((5| but now for three dimensions. 



6.1 Halo-model description 

6.1.1 General halo-model formalism 

The halo model is an analytical prescription for the clus- 
tering of dark matter that was motivated by N-body sim- 
ulations of the cosmi c structure formation (for a review: 
ICoorav fc Shethll2002f ). All matter is enclosed inside typi- 
cal haloes with a given mass spectrum. Galaxy mock cat- 
alogues can be generated within this framework by popu- 
lating virialised dark matter haloes with galaxies accord- 
ing to a prescription taken directly from semi-analytic mod- 
els of galaxy formation or hydrodynamic simulation that 
include recipes for the formation (evolution ) of galaxies 
iBerlind fc WeinberdlioO^ : iBenson et allbOOOl ) . The param- 
eters for populating the haloes with galaxies depend solely 
on the halo mass. The halo-model description has been quite 
successful in describing or fitting observational data, al- 
though doubts about the strict validit y of the basic mode l 
ass umptions have been cast recently by Gao fc White! (|2007l ') 
and ISheth fc TormenI l|2004l ) (assembly bias) . 

One key assumption of the halo model is that the dark 
matter density or galaxy number density are linear super- 
positions of in total A'^h haloes with a typical radial profil(E|: 



ni{r) = ^ Ni{m.j)ui{r - 



(35) 



The halo profiles, Ui, describe the spatial distribution of 
galaxies, i > 0, or dark matter, i — 0, within haloes. All pro- 
files considered here are normalised to unity. 

The profiles depend on an additional parameter, m, 
which is the total mass of the dark matter halo, r is a posi- 
tion within the comoving frame and is the centre of the 
i-th halo. By Ni{mj) we denote either the number of galax- 
ies, type i, populating the j-th halo (if i > 0), or if i = 
the dark matter mass, ruj , that is attached to the j-th halo. 
In the latter case, no{r) is simply the dark matter density 
as function of position. In particular. No and Ni with i > 
have, for convenience within the formalism, different unit^ 
(halo mass versus number of galaxies). 

In general, the halo-occupation number Ni(mj) and 
the halo centre, Tj, axe random numbers. The conditional 
probability distribution of the halo-occupation number, 
P{Ni\mj), is a function of the halo mass only. In the case 
of dark matter, i = 0, one has simply Ao(mj) = nij, i.e. 
P(Ao|mj) = 5d(Ao — ruj), thus the mass associated with a 
"dark matter halo rut" is always rut, whereas the number 



like galaxies, that are only on average distributed according to 
the halo profile, the more accurate expansion would be: 



ni{r) 



(34) 



^ Actually, this expansion assumes that galaxies or dark matter 
are smoothly spread out over the halo. This is a good approxi- 
mation if the number of particles is large. For discrete particles, 



where Ar^; is the position of the l-th galaxy relative to the fc-th 
halo centre. The discreteness makes a difference for the one-halo 
term, which is taken into account for the following results. 
^ The proper normalisation for the power spectra is done by A'^^. 
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of galaxies living inside haloes of same mass may vary from 
halo to halo. 

Both Niirrij) and Ui{r,mj) depend solely on nij and 
not on the mass or position of any other halo. Moreover, the 
position of a halo, r^, and its mass, ruj, are postulated to 
be statistically independent. However, the halo-occupation 
numbers of different galaxies i and j inside the same halo 
may be statistically dependent on each other. For example, 
inside the same halo the number of red galaxies may be 
related to the number of blue galaxies and vice versa. This 
is the idea behind the concept of the joint halo-occupation 
distribution. 

Based on these assumptions, the halo model predicts 
the 3D-power spectra of galaxy number density correla- 
tions, galaxy-mass density correlations and mass-mass cor- 
relations, Pij{k), as function of a halo mass spectrum, halo 
bias parameters, typical density profiles and a HOD of galax- 



^ '- = {2-KfP,j{\k\)5u{k + k') 



niUj 



(36) 



Here and in the following, {. . .) is the statistical average 
over all possible haloes, and means the average number of 
galaxies per unit volume. This paper only considers isotropic 
halo density profiles Ui{r, m) independent of the direction of 
r, so that: 



Ui{k, m) 



/g°°drrfc ^ Ui{r,m) sin (kr) 
dr Ui (r, m) 



(37) 



This equation includes a normalisat ion of the density profile. 

F ollowing the calculations of IScherrer fc Bertschingen 
l)l99ll ^ one finds for the power spectra (fe > and A'h S> 1): 



P,j{k)=P^J'ik)+P^J'ik) , 

where the so-called one-halo term is 

roc 

Pl^{k) = Amn{m)K]f{k,m) 
pNiNj Jo 

the two-halo term 



(38) 



(39) 



Pi/ik) = / dmin(mi) / dm2n(m2)x (40) 

NiNj Jo Jo 



P{k, mi, m2)Kij {k, mi, 7712) 



and 



/ \ — f°° 

dmn{m) {Ni{m)) ; ni — I dmn{m)in . (41) 



By p = Pcritf^m we denote the mean comoving matter 
density of the dark matter that is included inside the dark 
matter haloes. The constant is the matter density pa- 
rameter and pcrit the critical density. 

The different cases of power spectra (galaxy auto-power 
spectra, galaxy cross-power spectra, dark matter/galaxy 
cross-power spectra) give different results for the above in- 
tegrals. To save space, all variants are encapsulated into the 
integral kernel^ 



Kl^{k,m) 



Kif{k,m) 



£ fi,'''(fc,m) J2 n^[\k,m))- (43) 

q— 1 r— 1 
q = l 

Niim) Nj(m) 

9=1 7—1 



where (...) denotes the average over all haloes of mass m 
and iif\k,m) expresses the spatial probability distribution 
of the q-th, out of g £ [1, Ni{m)], galaxies belonging to the 
sample i and a particular halo of mass m. 

We need this further distinction into various spatial 
distributions since we may split a galaxy sample into one 
central galaxy - sitting at the halo centre hence having 
u{k,m) = 1 - and {Ni{m) — 1) satellite galaxies with a dif- 
ferent distribution. The term with the Kronecker pre-factor, 
5^, in Eq. (|43|l is only applied if both samples i and j are 
discrete (galaxies). It accounts for the subtraction of white 
shot-noise contribution, l/fii, to the clustering power that 
is not measured due to the definition of the clustering cor- 
relation function, uj{9) (excess of pairs over a uniform dis- 
tribution), and the aperture statistics. For the smooth dark 
matter, i — 0, one has to substitute in the Eqs. (|43[) and (|44|l 



^^(m) ii'^i^ (^f^^r/ji^ = rnuo{k,m), which simplifies the equa- 



tions 



As an example, if all galaxies of a same sample i have 
identical spatial distributions, Ui{k,m), one will find for i 7^ 
j and i,j > 0: 



JVi(m) 

u'f\k,m) = Ni{m)ui{k,m) , 

9=1 



(45) 



Ni(m) 



9 = 1 



The mean number density of galaxies is 

m = (m{r)) = ^Ni . (42) 
\ / m 

The function n(m)dm is usually the mean number den- 
sity of haloes within the mass interval [m,m + dm]. Note, 
however, that in the above equations, and for the following 
fii, n{m) may be rescaled by any arbitrary constant with- 
out changing the results. The average (^Ni{m)^ is the mean 

number of galaxies found within a halo of mass m (for i > 0), 
or the mass of the halo itself, if i = 0. Notice that after the 
statistical average performed for the power spectra we have 
shifted the notation from Ni{mk) (number of i-galaxies in- 
side halo k) to Ni(m) (number of i-galaxies inside a halo of 
mass m). Eq. ll34t . 



Klf{k,m) = Ui(k,m)uj(k,m)(^Ni{rri)Nj(rn)^ , 
\k,m) = Ui{k,m)uj{k,m)(^Ni{rn)^(^Nj{rn)^ 



A galaxy sample with Ni°'^{m) £ {0, 1} central galaxies and 
Nf^^{m) satellites of same in-halo distribution Ui{k,m) has 



(46) 



From Eq. (|45p and (|46|l all kernels relevant for this paper 
follow. They are listed in Table [3] 

The function P{k, mi , m2) means the cross-power spec- 
trum of the number densities of haloes with masses mi and 

^ The spatial distributions of galaxies inside the same halo are 
postulated to be statistically independent, the kernels follow from 
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Table 3. Integration kernels for the one- and two-halo term, Eqs. 1)39 | l and H40| l. In the simple model, all galaxies of same type are 
distributed over the halo the same way. For the central-galaxy scenario, the halo occupation, Ni(m) = Af™°(m) + Af?''*(m), of every 
galaxy type, i, is split into a central galaxy and satellite galaxies. Here, it is explicitly assumed that Af|'^'(m) > requires Af™"(m) = 1, 
i.e. (Afj'^''°(m)Afj°'''(m)) = (Af?''*(m)). The mixed model assumes one type of galaxies, i, to possess a central galaxy and another, j, 
having no central galaxy as in the simple model. For that case, i is described by the central model, j by the simple model; only for the 
cross-correlation of both a new kernel is needed. Note that NQ{m) = m, dark matter is always "simple". 



i 


j 


integral kernel K^^{k, m) 


model type 




i > 
i > 




j = i 


[{jo(fe, m,)]-^m,^ 

[u,{k,m)]^(^N,{m) {Nii-m) - 1) ) 
Ui{k, m)uj (k, m)(^Ni(m)Nj (rri)^ 


simple 


i > 
i > 
i > 




j = i 

j^i>0 


uo{k, m)m(^N^'"^{m)^ + Ui(k, m)uo{k, m)m(^Nf'^*- (m)^ 
2ui{k,m)(^Nl'^^{m)^ + [ui{k,m)]'^ (^Nf'^^ {m) {Nf^^(m) - l) ) 
Ui{k,m)(^N^'^*-{m)N^°'^{m)'j + (fc, m)<^Af|='* (m)Aff<=" (m)^-f 
Ui{k,m)uj{k,m)(^NI''^{m)N^''^{m)'^ + ^JVf™(m)iVj^<="(m)^ 


central 


i > 


j + i 


Uj{k, m)(^Nf'"^{m)Nj (m)^ + Ui{k, rn)uj (fc, m)(^Nf'^^ {m)Nj (m)|) 


mixed 


i 


j 


integral kernel K^^{k,mi,Tn2) 


model type 


i 


j 


Ui{k, rai)uj(k, m2)(^Ni(m\)^ (n j(m2)^ 


simple 


i > 
i > 




j > 


[tij(fc,mi)(|iVf<="(mi)^ -f Mi(fc,mi)uj(fc,mi)<^Arf=''(mi)^] X 
[Si (fc , ma ) (| (ma ) ) -f Si (fc , m2 )Sj (fc , ma ) <^ AfJ^'' (m2 ) 


central 


i > 


j + i 


iij(fe,m2)(A';,(ma)) [(Aff<="(mi)) -f ni(A:,mi)<^Arf=''(mi))] 


mixed 



ma. It is common practice to assume a linear deterministic 
biasing between the halo number densi ty and the linear dark 
matter density (|Coorav fc Shetbll2002l ): 

P{k,mi,m2) X Piin{k)b{mi)b{m2) (47) 

with b(m) being the linear bias factor of haloes with mass 
m and Piin{k) the linear dark matter power spectrum. This 
reduces the 2D-integral in Eq. (|40|l to a simpler product of 
ID-integrals. 

6.1.2 Joint halo-occupation distribution 

Within the framework of the halo model (see Eqs. (|39|l and 
([40])), one works out the clustering statistics of galaxies by 
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specifying the mean number of galaxies for a halo of cer- 
tain mass m, {Ni{m)), and the mean number of galaxy 
pairs, either pairs of the same galaxy type (auto-power), 
{Ni{m){Ni{m) — 1)), or pairs between different galaxy types 
(cross-power), {Ni{m)Nj{m)) . Things become slightly more 
difficult, though, if we distinguish between central galaxies 
and satellite galaxies as can be seen in Table |3] 

In general, the number of galaxies or galaxy pairs for a 
fixed halo mass are first and 2°'*-order moments of a joint 
halo occupation distribution (JHOD) of two galaxy popula- 
tions i and j. The JHOD, P{Ni, Nj\m), determines the prob- 
ability to find a certain number of "galaxies i" and "galaxies 
j" inside the same halo of mass m. The 2"'^-order moment. 



Clustering and joint HOD of red and blue galaxies in COMBO-17 17 



Anti-correlated joint HOD 



Uncorrelated joint HOD 




R=-l 



R=0 




R=+l 



Figure 10. Simplified illustration of the concept of the joint halo occupation distribution for three cases. Left panel: the dark matter 
haloes (shady disks) are filled with different galaxy types that tend to avoid each other. If the number of red, elliptical, galaxies is 
increased for a halo of given size, the number of blue spiral, galaxies decreases. Right panel: over- and under densities of the two galaxy 
populations are highly correlated. If the number of red galaxies for a halo of a given size increases, the number of blue | 
as well. Middle panel: the halo-occupation numbers are not correlated. 



ilaxies increases 



{N,{m)Nj{m)} = ^ P{Ni, Nj\m) mNj 



(48) 



single haloes, whereas r{9^p) is a correlation of galaxy num- 
ber densities as seen in the angular clustering of galaxies. 



of the JHOD can conveniently be parameterised in terms of 
the mean halo-occupation rmmber, 



JVi,JV,-=0 



the (central) variance of the HOD, 



a,(m) = y(iV2(m))-(iV,(m))' 
and the JHOD-correlation factor: 



(49) 



(50) 



'm{m)N,{m)) - (N.{m))(N,{m)) 

Rrj {m) = ^ ; , \ , ^ ^ ^ . (51) 

ai{m)aj{m) 

The JHOD correlation factor expresses the tendency of 
two populations to avoid or attract each other inside the 
same halo. See Fig. [TD] for a simplified illustration. This ten- 
dency, however, can only be seen in the one-halo term of the 
cross-power spectrum, Eq. (|39p . which is observable when 
considering the cross-power (cross-correlation function) of 
two galaxy populations for small separations. 

The effect of the JHOD correlation factor Rij{m) on 
the cross-moments becomes negligible, i.e. 



Ni{m,)Nj{m) 



N,{m)){Nj{m 



(52) 



if the relative fluctuations in galaxy numbers inside haloes 
become small, i.e. if ai{m) / (^Ni{'m)^ -ti 1. Since haloes with 
mean galaxy occupation numbers of more than roughly 
one are expected to have a Poisson variance, one can ex- 
pect that the JHOD correlation factor loses significance for 
massive haloes. For large m the relative variance becomes 
(X l/V(iV.(m)). 

In order to avoid confusion, we would like to stress again 
that Rij expresses the correlation of galaxy numbers inside 



6.2 Adopted model for JHOD of red and blue 
galaxies 

In the following we describe the details of the HOD of the red 
and blue galaxy sample. A galaxy sample (blue or red) can 
either have a central galaxy and satellites or solely consists 
of satellites. Satellites are distributed according to the dark 
matter in a halo, but possibly with a different concentration 
parameters. Central galaxies sit at or close to the centre 
of a halo, for the latter of which our "central models" are 
an approximation. We will distinguish three different model 
flavours: 

(i) A scenario in which we have only red and blue satel- 
lites populating a dark matter halo. 

(ii) A scenario in which red galaxies are both central and 
satellite galaxies, blue galaxies are only satellites. 

(iii) A scenario in which we have red and blue central 
galaxies and red and blue satellites. 

Case ii) is motivated by the observation that galaxy 
clusters often have red galaxies as central galaxies. Case iii) 
is motivated by the possibility that the observed galaxy- 
galaxy lensing signal of the blue galaxies and the power-law 
clustering may also require a central blue galaxy. 

We start off with the description of the HOD of a single 
sample. The interconnection of the red and blue sample is 
discussed later on where we address the problem of red/blue 
galaxy pairs. 

For modelling the HOD of our red and blue galaxy 
sample we use, with some minor modifications, th e 
parametrisations that wer e dis cussed in 'Zheng et al.] l|200'iD . 
IZheng fc Weinberg! (|2007l ) and lZheng et al.. (,20051 )7^ 
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6.2.1 Mean galaxy numbers and distribution inside haloes 

The HOD (red and blue) for haloes with mass m, 

N{m) = iV™"(m) + iV""'(m) , (53) 

is split into one central galaxy, N'^'"^{m) G {0, 1}, and satel- 
lite galaxies, N'^'^*{m). A central galaxy is placed at the cen- 
tre of the halo. If there is a galaxy inhabiting a halo, there is 
always one central galaxy. This is used in the following rela- 
tions. Satellite galaxies are distributed according to a NFW 
profile with concentration parameter c' = fc, where c is the 
concentration of the dark matter (Appendix For / > 1 
galaxies populating the haloes are more concentrated than 
the dark matter, while for / < 1 galaxies are less concen- 
trated than the dark matter. 

The halo mass dependence of the mean HOD is assumed 
to be 



iV(m)^ = ^iV'=™(m)^ -h^iV=="(m)^ 



(54) 



AT" 



1 +erf 



log m — log m„ 



^log m 

- mo' 



(55) 

(A=^'(m)) = ^7V^-(m))(^^i^)'H(m-mo),(56) 



where 



erf (x) 



2_ r 



dte~ 



(57) 



is the error function. 

We would like to keep the number of parameters, re- 
quired to explain the data, as small as possible. For that 
reason, we set mo = mmin because we found that a free 
mniin does not significantly improve our fits. An additional 
parameter / 7^ 1, on the other hand, that gives some free- 
dom in the shape of the density profiles of blue and red 
galaxies yields improved fits and is therefore included. 

Note that due to the previous definition of {N{m)) 
we can still have (central) galaxies for m ^ mmin, as 
1 + erf (s) 7^ for 2; < 0. Other autho rs prefer to define a 
hard cut-off for N{m), as for instance in lPhleps et al.l (|2006l ). 

6.2.2 Number of galaxy pairs of same sample 

In the original model, the fluctuation in the number of 
satellites is assumed to be Poi sson (referred to as " Pois- 
son satellite" model hereafter; iKravtsov et al.l 120041 '). i.e. 
^[^Batj2^ = (Ar=^*)2 + {N'"'*-}. Their assumption completely 
fixes the variance of the total number of galaxies inside a 
halo to (we skip in the following the arguments "(m)" to 
save space): 

a^iN) = (N^) - {Nf 

= {NiN-l)) + {N)il~{N)) (58) 
= 3{Af="') + (AT"™) (1 - 2{A="')) - {N""^f , 

because the variance of the number of central galaxies is 
always 

a' (A""") = {A"'=")[l - {A"°")] , (59) 

owing to the fact that A*^"" is only zero or one (Bernoulli 
distribution). In particular we have ([A'^'^"]^) = (A'^™). 

The HOD variance or number of galaxy pairs of 
the "Poisson sate l lite" model can, using the notation of 
IScoccimarro et ahl l|200lf ). be written as 



{A(A- 1)) = a^{A) = 



{A°"')((A''"') -F2) 



(60) 



where {A(A — 1)) is the mean number of galaxy pairs, re- 
gardless of whether they are central galaxies or satellites. 
The variance (or number of galaxy pairs, see Eq. I\5S^ ) be- 
comes Poisson for haloes m with a — 1, sub-Poisson for 
Q < 1 and super- Poisson for o? > 1. In the "Poisson satel- 
lite" model a is a function increasing slowly from zero near 
m = mmin to unity for large {A(m)). See the dashed line in 
right panel of Fig. [11] 

We found, however, that this mean number of galaxy 
pairs per halo hardly reproduces the deep decrease in the 
bias parameter r{Oap) of our red and blue galaxy sample 
(see Section r4.3|) . presumably because it becomes Poisson at 
too large m. 

For that reason, we relax the assumption of the "Poisson 
satellite" model by introducing another model parameter. A, 
that delays the onset of a Poisson variance in A(m), A < 1, 
or accelerates it, A > 1. We achieve this by keeping the shape 
of a(m) as in the "Poisson satellite" model, Eq. (|60|l . but 
rescaling the (log)mass scale, giving us a new a'{m): 



a(^m X [m/mjninY 



(61) 



Thus, for A = 1 our model uses exactly the same HOD 
variance that is postulated in the origi nal satellite model. 
Comp are this to the parametrisation of IScoccimarro et al.l 
(|200ll ). where a(m) is postulated to increase linearly with 
Inm from mmin onwards. 

The number of satellite pairs, needed for the power 
spectra integrals, is then calculated from a'(m) via 



(62) 



invoking the relation between a(m), now substituted by the 
rescaled a'(m), and the mean number of galaxy pairs. Fol- 
lowing from this, the variance of the number of satellites is 
consequently 



a' (A" 



f (A)' - (A="')({A="') -f- 1) 



(63) 



The variance of the total number of galaxies, required for 
models lacking central galaxies, is simply 



a\N) = [af {A)' + (A)(1-{A)) 



(64) 



The variance parameter a(m) cannot be reduced arbi- 
trarily, though, as a too small variance (too sub-Poisson) 
may be in conflict with the mean number of galaxies. Put 
in other words, the number of pairs, Eqs. (|60|l and (|62p . and 
the variances, Eqs. (|63p and ((64}, have to be larger or equal 
than zero, i.e. 



] ^ max 



2{A°=^') {A°=^')(1-K{A°^')) ^ ]_1 



(65) 



{A)2 ' {A)2 ' (A) J 

If [a']^ is smaller than the r.h.s., we set [a']^ equal to the 
r.h.s. As a' is usually an increasing function with halo mass 
m, this resetting is not necessary for A ^ 1 but may be 
important if we try to delay Poisson statistics compared to 
the "Poisson satellite" model. 

The effect of A on the bias parameter r(Sap) is demon- 
strated in Fig.[TT]for some particular examples. It is remark- 
able to see that an early (low m) Poisson variance in the 
galaxy number suppresses the cross-power towards smaller 
r(Sap) 1, while a late Poisson variance (higher m) yields 
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Figure 11. Effect of the tlOD variance parameter A (for both populations equally) on the cross-c orrelation function of two hypothetical 
populations of galaxies. The parameters for the galaxy populations were chosen arbitrarily from lzhene et al] (|2005| ). As model param- 
eters red galaxies have {1012-69Mq, 0.15, IQI^-S^Mq, 10i3-82]y[Q 08}, blue galaxies have {10"-^*Mq, 0.15, IO^'^'^Mq, IO^Mq, 1.02} 
for {rdmin, ciogmi ™0, 'Ti', e} (see text for details). The galaxy concentration is set / = 1 in both cases, both populations have no cen- 
tral galaxy component. The JHOD correlation factor is set to R = +1. Galaxies are uniformly distributed within the redshift range 
z G [0.2,0.4]. Left panel: Different lines correspond to a bias parameter r(9ap) for different A (numbers attached to solid lines). Top line: 
a variance of the HOD according to the "Poisson satellite" model, A = 1. Bottom line: a variance almost Poisson for all haloes, A = 10. 
Right panel: The corresponding HOD variance parameter, [o'(m)]^, of the blue galaxies for some of the lines in the left panel. 



higher values for r{8^p) > 1 on small scales. Therefore, the 
bias parameter r(Sap) promises to be a probe for the vari- 
ance in the HOD. 

Finally, with this parametrisation each galaxy sam- 
ple has six, {mmin, CTiog m, e, /, A}, different parameters. 
Moreover, we have one more additional parameter which 
expresses the correlation within the JHOD of blue and red 
galaxies. 

Based on this, we consider three different flavours of the 
model - detailed in the following sections - which permit 
red or/and blue galaxies as central galaxies and always have 
red and blue satellites. If a sample has no central galaxy, 
we still use {N{m)} as in Eq. and {N{N - 1)) as in 
Eq. (|60|l (with rescaled a) but distribute all N{m) galaxies 
like "satellites" over the halo; {N{m)) is the HOD of all 
galaxies in that case. This tests the idea - no central galaxy 
for a galaxy population but everything else in the model 
unchanged - if the data really requires a galaxy sample to 
have a central galaxy. 

In Fig. [12] we show a few examples for the linear 
stochastic bias between two galaxy populations based on 
the three scenarios, detailed below. By splitting the con- 
tributions to the cross-power spectrum, stemming from 
the one- and two-halo terms, the figure shows in which 
regime we can expect the two different terms to domi- 
nate. The transition is roughly at an aperture radius of 
20 arcmin for a mean redshift of z; ~ 0.3. This corresponds 
to an effective projected (proper) scale of 5.6/i~^Mpc. 
For comparison, the (comoving) virial radius of a NFW 
halo with lAx 10^" /I'^M© is rvir ~ 0.8/i"^Mpc, or 

rvir « 0.64/i-^Mpc (m/lO"/^"^M0)''■^^ Also shown is the 
clear impact of the JHOD correlation factor. Galaxies with 
tendency of avoidance have a smaller cross-power. 



6.2.3 Number of cross-pairs for red and blue central 
galaxies 

The statistical cross-moments for a central galaxy sce- 
nario (central galaxies for both or just one population) 
requires generally more information on the JHOD than 
just first- and 2°'^-order moments of P{Ni,Nj\m). The 
correlations between satellite numbers and central galaxy 
numbers have to be specified as well. Namely, required 
are also {Ni{m)\Nj{m) — 0), the mean number of i- 
galaxies for haloes which do not contain any j-galaxy, and 
P{Ni{m)Nj(m) = 0), which is the probability to find either 
no j-galaxy or no j-galaxy inside a halo of mass m. We show 
this in Appendix iBl 

To not unnecessarily increase the number of free model 
parameters in a central-satellite galaxy scenario, we can 
make a reasonable approximation, though. For every galaxy 
population having central galaxies, we set N"^"^ = 1 for halo 
masses m where the mean number of galaxies is {N{m)) ^ 1, 
and A'^'^''' = otherwise. This means that if the mean num- 
ber of galaxies belonging to a population is less or equal 
one, it is essentially only the central galaxy we can find in 
those haloes. On the other hand, if the mean galaxy number 
is larger than one, then there is always at least a central 
galaxy. 

Applying this approximation allows us to express the 
cross-moments of the HOD entirely in terms of known vari- 
ances, means and the JHOD correlation factor Rij{m). In a 
scenario with blue and red central galaxies, we hence distin- 
guish four cases: 

(i) Case: (N,) < I, (Nj) < 1 
Here, we have 

(ivr'A^"") = (ivf 'ivr") = (ivr'A^D = o . (66) 

(ii) Case: (A^,) > 1, (A^,) < 1 
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aperture radius [arcmin] aperture radius [arcmin] aperture radius [arcmin] aperture radius [arcmin] 



Figure 12. Example predictions for the relative linear stochastic bias of red and blue galaxies for three different scenarios (with/without 
central galaxy component, only red galaxies have a central component). The model parameters are chosen as in Fig. [TT] except that we fix 
A = 3.0 for both blue and red galaxies and we vary the JHOD parameter R. Left figure: The bias parameter r(9ap) is plotted (solid lines) 
for two different JHOD-correlation factors (halo-mass invariant), R = ±1. To separate contributions to the cross-power originating from 
the one-halo (Ih; dashed line) and two-halo (2h; dotted line) term, the cross-power was recalculated for R = +1 considering only one 
contribution at a time. The solid thick line in the right panel assumes a model where the blue and red samples have a central component, 
whereas the thick dashed-dotted line is for a model where only the red sample has a central component (no one-halo/two-halo term 
splitting). Right figure: Linear bias factor, 6((9ap) (solid lines; red galaxies are more strongly clustered). The JHOD-correlation factor 
has no impact here. To estimate the importance of the one- and two- halo terms, fe(^?ap) was also recalculated with the red galaxy power 
spectrum consisting of only either the one- or two-halo term. The thick dashed-dotted line in the right panel corresponds to the model 
where only the red sample has a central component (no one- halo/two-halo term splitting). 



We will use: 

{^iVr'A^f*) = ^Afr^iVf) = , (67) 

(iii) Case; (iV,) ^ 1, {Nj) > 1 
Finally for richly populated haloes, we will have; 

^A^r'A^r) = (^r) ; (A^r^A^f") = (at') , (68) 
^^sat^sat^ = ^Arr*^^7Vf*^ -|-i?,,a(iVr')cr(Aff*) . 

For the sake of simplicity, this model unrealistically also 
assumes that red and blue galaxies have central galaxies 
simultaneously, which would mean for haloes hosting red and 
blue galaxies that we would find a red and blue galaxy at, or 
very close, to the centre. A more realistic model would add 
another rule that decides when (and with what probability) 
we have a red or a blue central galaxy, but never both at 
the same time. 

However, our approximation is fair as long as we rarely 
find haloes with one red and one blue galaxies. As seen later 
on, our galaxy samples start to populate haloes at different 
mass-thresholds, blue galaxies from ~ 10^^Mq/i~^ and red 
galaxies from ~ lO^^Mo/i"^ This means a) haloes that have 
one red galaxy have typically more than one blue galaxy and 
b) haloes with one blue galaxy have typically no red galaxy. 
In the first case a) , the auto-power spectrum of blue galaxies 
is dominated by the satellite terms so that placing one blue 
galajcy at the centre does not make much of a difference. The 
same is true for the red/blue cross-power. In the second case 
b), we have no red galaxy that could be placed at the centre 
together with a blue galaxy so that the problem does not 



arise in the first place. To further improve the approxima- 
tion, we also set for the cross-power kernel K^'^{k,jn), Table 
[3l the cross-correlation {Ni'"'Nj'"^) = which has to vanish 
if there is always only one central galaxy inside a halo. 

6.2.4 Number of cross-pairs for no central galaxies 

We also explore the possibility that neither the red nor the 
blue sample of galaxies have central galaxies. We do this by 
distributing the central galaxies of the previously described 
model with the same density profile as the satellite galax- 
ies. Therefore, we have a mean galaxy number according to 
Eq. (|54p with a variance as in Eq. (|64|l . The mean num- 
ber of galaxy pairs is expressed by [a']'^{N)'^. Moreover, for 
this scenario the factorial cross- moments are straightfor- 
ward, the approximation described in the forgoing section is 
not necessary. The cross-correlation moment of the JHOD 
is thus; 

(iV.iVj) = {N^){N,) + R,,a{N,)a{N,) . (69) 

6.2.5 Number of cross-pairs for mixed model with only red 
central galaxies 

As last scenario we consider the possibility that only the red 
sample has central galaxies, while blue galaxies are always 
satellites. Within this model, blue galaxies are described ac- 
cording to the "no central galaxy" scenario and red galaxies 
according to the previous "central galaxy" scenario. 

Again, for cross-moments in principle we also needed 
to specify {Nj{rn)\Ni{rn) = Q), j denotes blue galaxies and 
i the red galaxies (Appendix [Bj . One finds by using the 
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aforementioned approximation; 

(NrN,) = (^N^){N,)+R^Ja{N^MN,) , 
(^Nr'Nj^ = (70) 
for (A^i) < 1, and 

(iVriV,) = (iV,), (71) 
otherwise. 

6.3 Fitting the halo model to the data 

6.3.1 Method 

We now use all the results on the clustering of the red and 
blue galaxy sample and their correlation to the dark matter 
density, gathered on the foregoing pages, to constrain the 
parameters of our halo model, see Sect. 16.21 The number 
densities of blue and red galaxies for each redshift bin is 
added as additional information (Table [1] bottom) to be 
fitted by the model as well (Eq. |42]| . 

How can we relate the halo model 3D-power spectra to 
the observables, which are projections onto the sky? The 
mean tangential shear about the lenses is, for the i-th red- 
shift bin, related to the angular cross-power spectrum be- 
tween the lens number density and lensing convergence, 
■PrW> by fe.g. ISimon erZll2007h 

/ \ f°° d££ 

(7t,.(^)> = j^^ ^prwMm ■ (72) 

A similar relation connects the A/'-statistics to the angular 
power spectrum of galaxy clustering, Pij{£), see Eq. ((2]|. 

To translate the 3D-model power spectra, Eq. (I38|) . 
to the projected angular power spectra for a given red- 
shift distribution of lens galaxies, Pi{w), and source galax- 
ies, Ps(w), we use Limber's equation in Fourier space 
l|Bartelmann fc Schneiderll200ll ): 




We denote the Hubble constant, the vacuum speed of light, 
the cosmological scale factor by Ho, c and a(w), respec- 
tively. The second argument w in the 3D-power spectrum, 
Pij{k, w), is used to express a possible time-evolution of the 
power as function of comoving radial distance. 

We expect the evolution of the 3D-power spectra within 
the lens galaxy redshift bins to be moderate so that we ne- 
glect the time-evolution over the range of a redshift bin. 
The 3D-power spectra are computed for a radial distance 
w = {wi + W2)/2, where wi and W2 are the distance lim- 
its of the redshift bin. This particular w is justified by the 
redshift probability distributions inside the z-bins. Fig. (2] 
which are relatively symmetric about the mean. 



Table 4. Quality of the halo model fits and a Bayesian com- 
parison of the three different models describing the same data. 
The x^/dof per degree of freedom (dof = 26 — 13 = 13) is for 
the maximum-likelihood fit to the data. By AlniS we mean the 
difference of the Bayesian (log)evidence for a particular scenario 
relative to the scenario with the highest evidence at the same 
redshift. 



z 


x'/dof 


Aln£; 


model type 


0.3 


0.51 


0.00 


no central galaxies 


0.3 


0.56 


0.44 


red central galaxies 


0.3 


0.45 


0.57 


blue and red centrals 


0.5 


1.25 


0.00 


no central galaxies 


0.5 


1.20 


0.86 


red central galaxies 


0.5 


1.24 


0.88 


blue and red centrals 


0.7 


0.57 


0.00 


no central galaxies 


0.7 


0.50 


1.02 


red central galaxies 


0.7 


0.60 


1.19 


blue and red centrals 


0.9 


0.59 


0.00 


red central galaxies 


0.9 


0.55 


0.07 


blue and red centrals 


0.9 


0.62 


0.31 


no central galaxies 



Again, we employ the MCMC-method to trace out the 
posterior likelihood function of our halo-model parameters. 
The A/'-statistics and the GGL signal are put together to 
constrain the model. In general, the JHOD correlation fac- 
tor R{m) (we have just one: between the red and blue sam- 
ple) is a function of the halo mass. As the effect of R(m) 
becomes negligible for small relative fluctuations in the halo 
occupation number of galaxies (large m), we assume the 
same correlation parameter for all m, which is, consequently, 
mainly constrained by haloes with a small number of galax- 
ies (m ~ TTiniin). For the scope of this paper, where we have 
relatively small galaxy samples and relatively large uncer- 
tainties in the clustering statistics (compared to SDSS, for 
example) this is an acceptable approximation. For future 
studies investigating a mass-dependence of the JHOD cor- 
relation parameter may be interesting and feasible. 

We confine the parameter space of the model (flat pri- 
ors) to lO'^M© (m^in,m') ^ lO^^Mo, lO'^ «: aiog™ sC 1, 
s; e s; 2, ji?! ^ 1, 10"^ ^ / 3 and 0.9 ^ a «; 10. Three 
model scenarios are fitted separately: i) both the blue and 
red populations have central galaxies, ii) no central galaxies, 
and iii) only the red sample has central galaxies. 

In order to possibly discriminate between the three sce- 
narios we estimate the Bayesian evidence for each scenario in 
every redshift bin (Appendix [Cj. This method is a more so- 
phisticated approach for model discrimination than a "sim- 
ple" comparison of a best-flt because, rather than looking 
merely at the height of the (posterior) likelihood function, 
the width of the likelihood function is taken into account as 
well. 

6.3.2 Results 

Fig. [TS] shows the model fits to the data belonging to the 
redshift bin z — 0.7. This redshift bin was chosen since it 
has the best signal-to-noise in our data. 

Note that the points (model prediction) in this figure 
are the model averages along the MCMC tracks and not the 
best-fitting models (minimum x^)- The Icr-standard devia- 
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golaxy separation [arcmin] galaxy separation [oromin] goloxy seporotion [orcmin] 

Figure 13. Average fit, compared to the observation, of tlie A/'-statistics {first row, points: variance of red galaxies, filled stars: variance 
of blue galaxies, open stars: cross-correlation) and GGL {second row; filled stars: blue galaxies, points: red galaxies) as predicted by 
the three different halo-model scenarios (first column: no central galaxies, second column: red central galaxies, third column: blue and 
red central galaxies) for the redshift bin z = 0.7. The model fits (average over the MCMC tracks) and their Icr-variance are denoted 
by points and errorbars, the shaded areas bracket the Icr-range in COMBO-17. For the GGL panels shaded areas are the combined 
constraints from all redshift bins. The corresponding model predictions for the number density of red galaxies are (in units of h^Mpc""^) 
11 ± 8 X 10^^ (no central), 8 ± 4 X 10~^ (red central), 9 ± 6 X 10~^ (red and blue centrals) and for blue galaxies 51 ± 15 X 10"'^ (no 
central), 57 ± 18 X lO""^ (red central), 65 ± 21 X 10~^ (red and blue centrals). 



tion of the fits is denoted by error bars. The MCMC-average 
is useful to study how well a model fits the data in general 
and where the main problems of the model in describing the 
data are located. 

We gather from this figure that, in general, qualita- 
tively all three model scenarios appear to provide a pretty 
good description of the data. The main differences appear 
for Sap < 1' (A/'-statistics) and 9 < 0!5 (GGL); the differ- 
ences in the A/'-statistics are small, though. The scenario 
with no central galaxies at all is bound to systematically 
smaller values for the A/'-statistics and, thus, has small diffi- 
culties in explaining the clustering statistics of red galaxies 
for smaller scales. Since statistical uncertainties grow large 
for small scales, those difficulties are not too significant for 
our data. 

Three other issues can be identified: 

• Although the galaxy clustering and the GGL is well 
described by the models, there is only moderate agreement 
between the predicted and observed mean number densi- 
ties of blue galaxies: the predicted numbers are higher. The 
measured value (Table [1} is, however, still within the 2a- 
scatter of fibiue of the Markov chains. For example, for 



z — 0.7 the model predicts consistently for all scenarios 
Wbiuc ~ (57 ± 18) X 10"^ /i^Mpc"^, whereas the data esti- 
mate is nbiuo = (18 ± 3) X 10~^ h^Mpc~^. This could point 
towards an inaccuracy of the halo model or/and to a system- 
atic underestimate of nbiuc provided by the Knax-estimator. 
The observed numbers of red galaxies are always within 
the Icr-scatter of the Markov chains although the means 
of the chains are always larger than the observed values, 
which again could be indicative of an overprediction by our 
halo-model. Tensions between the halo model predictions for 
galaxy clustering and number densi t ies we re also noticed by 
other studies, such as iQuadrT et al.l (|2008D . 

• The GGL-signal of the blue galaxies always confiicts 
the data beyond > 0'.5 (too high) because the observed 
AE quickly drops to essentially negative in that regime (if 
the GGL of all redshift bins is combined). Considering that 
most of the other model predictions fit quite well and that, 
to the knowledge of the authors, no negative GGL-signal for 
these galaxy separations has been found in the literature, 
this confiict could very well be a hint towards systematics 
in the lensing data undiscovered so far. 

• The model without red central galaxies is in mild con- 
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flict with the GGL measurement for the smallest separa- 
tions; a galaxy centred on the halo centre boosts the GGL 
signal, which is arguably favoured by the data (when the 
GGL-signal of all redshift bins is combined). 

Table |4] lists the (reduced) of the maximum likeli- 
hood fits and the estimated Bayesian evidence of the three 
different model scenarios. All scenarios at all redshifts give 
good fits to the data. The worst fits, x^/dof ~ 1.25, are for 
the redshift bin z = 0.5 which may be related to the sudden 
drop of the A/'-statistics for the largest aperture radius (up- 
per right panel in Fig. [6]). Since this still has a probability 
of ~ 20%, we can consider it as a statistical fluke. 

Except for z = 0.7, where we find weak evidence for a 
model without central galaxies, the Bayesian model discrim- 
ination does not prefer any particular model. Combining the 
evidence of all redshift bins (the Bayesian evidence just sums 
up), assuming independent statistical information, however 
yields substantial evidence ( A In i? ~ 2) for a model with- 
out central galaxies. On the other hand, by looking at Fig. 
[13] we concluded that a model without central galaxies has 
some problems describing the data on the smallest scales. 
Moreover, our central galaxy models have always, except 
for z = 0.3, a better ■ AH of which taken together makes 
us suspicious, if the Laplacian approximation that is used 
to compute the Bayesian evidence is really accurate enough 
for our work. Our model comparison therefore seems to be 
inconclusive and an improved statistical analysis or a larger 
data set will have to revisit this question in the future. 

Table [5] (after the bibliography) lists the constraints on 
the halo-model parameters for all redshifts and all scenarios. 
For an parameter average over all redshifts (bottom block 
of table), we combine the ID-probability density functions 
(PDFs), Pi{pj), as estimated from the MCMCs, of aU pa- 
rameters, pj, for all redshifts i to obtain a total ID-posterior 
ofpj-. 

P{P,)^llP^iPj) ■ (75) 

i 

From this total posterior we derive the mean and r.m.s.- 
variance of every parameter. Note that for top-hat and equal 
Pi's one has P — Pi, meaning combining information does 
not improve anything in this case. 

First of all, we find that the fits seem to be robust with 
respect to the three different model scenarios, with R, f and 
maybe m' being the only possible exceptions. 

For all aforementioned parameters, we do not see a clear 
trend with redshift, maybe with mmin being an exception 
which (the median) is increasing slightly in z = 0.9 but 
still is consistent with the rest. The redshift-combined re- 
sult of this parameter is rrimin = lO^^'^*°'^/i~^M0 for red 
and mmin = W^^'^^'^'^ h~^MQ for blue galaxies. Therefore, 
blue-cloud galaxies clearly populate smaller haloes than red- 
sequence galaxies. In fact, this is the halo-model explanation 
for the different clustering strengths of red and blue galaxies. 

The combined result for the parameter pair m' and e, 
describing the galaxy occupancy as function of halo mass 
beyond mmin, is roughly for both red and blue galaxies 
m' = lO" °=^° '*/i"^M0 and e = 1.1 ± 0.2, 1.3 ± 0.2 for red 
and blue galaxies, respectively. 

The least constrained parameter in our analysis is 
clearly aiogm which has for all redshifts 0.5 ± 0.2. This is 



essentially what one would expect from a top-hat PDF, non- 
vanishing between ... 1. Therefore, our data does not add 
information that significantly improves our prior. Combining 
all redshifts shrinks the la-confidences somewhat, though, 
because the individual ID-PDFs are not completely flat. 

Apparently only little more information is added to A, 
the HOD variance parameter, by the data. With a top-hat 
prior between 0.9 ... 10 we would expect as constraint A = 
5.5 ± 2.5, which is roughly what we find for the individual 
redshift bins, excluding A of blue galaxies for the model with 
red central galaxies only. However, the ID-PDFs are not 
completely flat preferring some regions in parameter space, 
too. In particular, they exclude values near A ~ 1 which 
is the HOD variance in the "Poisson satellite" model. As 
discussed earlier, this is because A = 1 cannot explain the 
observed deep drop in r(0ap) for small scales. Quantitatively, 
we infer from the redshift-combined PDF of A that we can 
decisively exclude values less than A = 2 for red galaxies 
and A = 3 for blu e galaxies with 95% confi dence. This fits 
to the findings of ICollister fc Lahavl l|2005h that found no 
indications of a sub-Poissonian variance, a{m) < 1, in their 
analysis of red and blue galaxies. 

For the combined concentration parameter of red and 
blue galaxies we find values that depend slightly on the 
adopted scenario. In a model with no central galaxies, red 
galaxies require a concentration larger than that of dark 
matter, /red ~ 1-9 ± 0.5, which, however drops to /red ~ 
1.3 ± 0.5 if a central red galaxy is allowed. Our conclusion 
is that the data demands a centrally concentrated spatial 
distribution of red galaxies either by a larger /rod or by a 
central galaxy. Conversely, for blue galaxies we find for no 
central galaxies, /blue = 1-0 ± 0.4, consistent with the dark 
matter distribution, but distributions flatter than dark mat- 
ter, /blue = 0.6 ± 0.3, if central galaxies are present. 

The JHOD correlation factor, R, turns out to be hard 
to measure. For the individual redshift bins we find little 
improvement compared to the flat prior. For all z-bins com- 
bined, we find R = -1-0.1 ± 0.2 (no central galaxies/only red 
central galaxies) and R — +0.5 ± 0.2 (blue and red cen- 
tral galaxies). This implies that the number of red and blue 
galaxies are uncorrelated in the first case and slightly posi- 
tively correlated in the latter case. 

Inside Fig. [7] we plotted the bestfit solutions to the 
relative linear stochastic bias of the red and blue popula- 
tion found along the MCMC tracks for a mean redshift of 
z = 0.7. The halo model reproduces the scale-dependence of 
bias factor and correlation factor at that redshift well (note 
that the shaded confidence regions are combinations of all 
redshift bins). All three scenarios reveal very similar trends 
with most differences for very small scales which, however 
due to measurement noise (galaxy shot noise), are not well 
constrained. If we use the halo model and bestfit parameters 
to extrapolate the bias parameters to large scales we find a 
bias factor between b ~ 1.46 — 1.58 depending on the partic- 
ular scenario. Those values reconcile our measurements at 
relatively small scales with results from various other stud- 
ies that measured galaxy biasing on larger scales (see Sect. 
I4.3|l . Furthermore, it underlines the need for both small and 
large scale measurements of galaxy clustering in order to 
discriminate different halo-model scenarios. In that context, 
we also would like to point to Fig. 1131 lower row. Here we 
can easily see that the three halo-model scenarios predict 
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clearly different GGL for galaxy separations smaller than 
~ 12", corresponding to a physical scale of ~ 60/i~^kpc. 
This shows that GGL has an important model discriminat- 
ing power on those scales. However, the halo model outlined 
here may be inaccurate on exactly those scales as it does not 
include the effect of lenses hosting ind ividual haloes inside 
their parent halo fe heth fc Ja"inll2003l ). Unfortunately, the 
statistical uncertainties in our GGL measurements do not 
allow to fully exploit this potential. 



7 SUMMARY 

For this paper we studied the clustering and, in particular, 
the relative clustering of red sequence galaxies and blue- 
cloud galaxies in COMBO-17 (fields: Sll, A901, CDFS) 
inside four redshift bins up to a redshift of 2 ~ 1. The two 
samples were separated by applying a redshift-dependent 
cut along the red-sequence. An additional cut was applied 
to assure that red and blue galaxies at all redshifts have 
roughly the same colour-dependent Afv-hmits. The red 
sample has (My) ~ —20.0 ± 0.1 mag, the blue sample has 
(Mv) = -18.8 ± 0.1 mag {h = 1.0). 

By looking at the spatial correlation function of the 
samples, we found for all redshifts combined a corre- 
lation length of ro = 5.5±0.9,3.0±0.4/i"^Mpc for the 
red and blue galaxies, respectively. The corresponding 
power-law indices of the spatial correlation function were 
S = 0.85 ± 0.10, 0.65 ± 0.08. A significant evolution of these 
parameters with redshift was not found. 

Parameterising the relative biasing of the red and blue 
sample in terms of the linear stochastic bias, we measured 
for all redshifts combined that the bias factor, 6(^ap), is 
slightly scale-dependent within a range of aperture radii of 
0^,p = 6"- 20', varying between 6 ~ 1.7 - 2.2. We found 
that the second parameter, r{9a,p) - quantifying the corre- 
lation of galaxy number density fluctuations as function of 
scale -, is scale-dependent, too. It drops from a value close 
to unity at larger scales of 9a.p ~ 20' to r ~ 0.6 ± 0.15 at 
dap ~ 6". An aperture radius of ^ap ~ 10' corresponds to a 
proper spatial scale of 2.8, 3.8, 4.5, 4.8 /i~^Mpc at the red- 
shifts z = 0.3, 0.5, 0.7, 0.9, respectively. The measurements 
emphasise the different clustering of the red and blue sam- 
ple at all redshifts, but do not exhibit a clear evolution with 
redshift beyond the statistical uncertainties. 

We also looked at the mean tangential ellipticity of 
a population of faint background galaxies as function of 
separation from red and blue galaxies (GGL). For this 
analysis, shear catalogues with a mean source redshift of 
z = 0.78 from the GaBoDS were taken. The GGL-signal 
detected corresponds to a projected differential surface 
mass density of AE = 35 ± 25 /iMqpc"^, red galaxies, and 
AS = 16 ± IO/iMqpc"'^, blue galaxies, at a galaxy-galaxy 
separation of ~ 12" (roughly ~ 60/i~^kpc). This indicates 
that the red galaxies are either typically more massive than 
blue galaxies, or are residing inside a matter richer environ- 
ment than the blue galaxies. 

A large part of the paper discussed a dark-matter halo 
based model that was employed to describe the angular clus- 
tering, including the cross-correlation function of clustering, 
the GGL-signal and the (ratio of) number densities of the 
red and blue galaxy sample simultaneously. Due to large sta- 



tistical errors for the GGL, the GGL-signal mainly served 
as an upper limit for a model predicted signal. 

We used three different variants of our halo-model, all 
having the same number of free parameters, to fit the data: 
i) neither the red nor the blue population have central galax- 
ies, ii) only the red sample has central galaxies, and iii) 
both the red and blue sample can have central galaxies and 
there is always one central galaxy. A Bayesian method of 
model discrimination was performed to decide which model 
at which redshift may be most suitable in explaining the 
data. The model discrimination was inconclusive, a more 
accurate treatment or a larger data set is required. In this 
context, we pointed out that GGL at small separations is 
most sensitive to the presence or absence of a central galaxy. 

Describing the cross-correlation function required the 
extension of the halo-model descriptions currently available 
in the lit e ratur e by a t least one additional parameter (see 
IScrantonI l|2003l . I2OO2I ) which is equivalent to our approach 
only if R{m) = 0). The extension is necessary for a full 
parametrisation of the 2"'^-order joint HOD of two galaxy 
populations. We called this parameter the correlation factor 
of the joint HOD of two galaxy populations, R{m). It ex- 
presses the tendency of different galaxy types to avoid {R{m) 
close to minus unity) or attract each other {R{m) close to 
plus unity) inside/into a same dark matter halo of mass m. 
A vanishing R(m) indicates uncorrelated halo-occupation 
numbers. The 2"''-order cross-correlation function of two 
galaxy samples, or equivalently r(&ap), is most sensitive to 
R{m) for small separations where the one-halo term is dom- 
inating (6'ap < 20' for z = 0.3). In principle, R{m) is halo- 
mass dependent but its impact on the cross-correlation func- 
tion becomes negligible for haloes with large galaxy occu- 
pation numbers (large m). Therefore, observations mainly 
constrain R{m) for smaller haloes. The measurements of 
R for the individual 2-bins yielded only little constraints. 
Combining all redshifts we found R = -1-0.5 ± 0.2 (positive 
correlation of galaxy numbers) for iii) and R — +0.1 ± 0.2 
for i) and ii) (no correlation). 

We found it necessary to add another new degree of 
freedom. A, to the model in order to explain the observed, 
towards smaller scales declining sub-unity bias parameter 
r(6'ap). The parameter A regulates how quickly a HOD of 
galaxy numbers populating a dark-matter halo assumes a 
Poisson variance, see Sect. 16.2.21 for an explanation. We 
demonstrated that the cross-correlation function of galaxy 
clustering is quite sensitive to A: r{6a,p) seems to reach val- 
ues substantially smaller than unity only for A > 1. With 
all models i)-iii) we decisively excluded, for red and blue 
galaxies, a value of A < 2 (red) and A < 3 (blue) with 
95% confidence. This rules out with high confidence, at least 
for our red and blue galaxy samples, a model like that of 
iKravtsov et al.l (|2004l ) ("Poisson satellites") for the mean 
number of galaxy pairs inside a halo. 

The halo model parameters of the red and blue sample 
mostly differ for mmin. We concluded that the average of the 
mass-scale for 2 = ... 1 is mmin = 10'^^'^^'''^/i~^Mq for red 
and mmin ~ lO^^'^*'''^ft~^M0 for blue galaxies. 

Finally, we presented evidence that red galaxies are 
more concentrated towards the halo centre than the dark 
matter, which either has to be achieved by a central red 
galaxy or a larger concentration parameter /red = 1.9 ± 0.5. 
The distribution of blue galaxies is consistent with the dark 
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matter distribution inside a halo for i), but flatter with 
/blue = 0.6 ± 0.3 for ii) or iii). 

As an outlook, we would like to mention that the joint 
HOD of two galaxy populations is also probed by a new 
statistics, third-order galaxy-galaxy le nsing (red and blue 

jalaxi es eis lenses), that are outlined in ISchneider fc Watts! 

2005f) and have bee n recently detected for the first time 



Simon et al.l (|2008l ). Combining those statistics with the 



second-order cross-correlation function in forthcoming sur- 
veys promises to set better constraints on the JHOD corre- 
lation factor. 
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Table 5. Compilation of MCMC results for the halo model parameters (see Section 16.21 1. obtained from fitting three different model scenarios to the A/'-statistics, relative galaxy 
numbers and GGL-signal. The three scenarios are denoted by "S" (simple, no central galaxies), "M" (mixed, only red central galaxies) and "C" (central, red and blue central galaxies), r-* 
respectively. The parameters of mmin and m' are in units of solar masses, /i~^Mq, the parameter R represents the JHOD correlation factor between the red and blue sample. Quoted 
parameter values stand for median and Icr confidence limits as derived from the MCMCs. The last block of values at the bottom of the table contains constraints (mean and r.m.s. 
variance) from all redshifts combined. The fiducial cosmological model uses WMAP3 parameters. 
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APPENDIX A: DARK MATTER HALO 
PROPERTIES OF MODEL 

For n(m), we use the semi- analytical peak-background split 
prescription of ISh eth fc Tor mcn (199^): 



n(m) oc 



liidln 



dlnm 

= 0.707 and p 



with V = 5c / {D{z)a{m)) , a 
pression 

2 [sin (rfc) — rfccos {rk)^ 



cr(m) = 



9 r 



Akk^ 



{rk)e 



(Al) 
0.3. The ex- 

Plin(fc) (A2) 



denotes the top-hat variance of the linear dark matter fluc- 
tuations, within spheres of radius r = ( 1 . The 

function D{z) denotes t he linear grow th factor, to be nor- 
malised to one for z = (|Peebleslll98"ol ): 



D{z) oc H{a{z)) 



(1+^)" 



da 7 



(A3) 



[a//(a)]3 ' 

where a{z) = (1 -I- z)~^ is the cosmological scale factor 
and H{a) the Hubble parameter. The constant Sc, the over- 
density of virialised haloes undergoing linear spherical col- 
lapse at 2 = 0, is chosen according to .Nakamura fc Sutol 
l| 19971 ): 



Sc 



3(127r) 



2/3 



20 



[14-0.0123 login fin 



(A4) 



For the cosmology adopted in this paper, one finds 
Sc = 1.674. For the linear dark matter power spectrum the 
prescription of lEisenstein fc Hiil ()l998h is employed. It is 
evolved to redshift z using the linear growth factor D{z). 

Th e halo bias fu n ction we implemented into our model 
is from [Tinker et al.l (|2005l') wh i ch is a modification of the 
original function of Sheth et al.l l|200ll) : 



b{m) = 14- 



VaSc 



(A5) 



(av^Y + h{l - c)(l - c/2)_ 

with a = 0.707, b = 0.35, c = 0.8 and u as previously 
defined. To guarantee that, when using this prescription 
for b{m), the two-halo term asymptotically fits the linear 
dark matter power spectrum Piin(fc) on large-scales (small 
k) we artificially normali se all Pi^{k) in Eq. (|40|) by (see e.g. 
iMandelbaum et al.|[2005l ) 

1 2 



m J 



dmn{m) b{m) 



(A6) 



This normalisation is 6 = 1.02, 1.08 for z = 0.0, 1.0. 

The dark matter density profile of haloes is in our model 
a truncated NFW (Navarro ct al. 199^): 



Mo(r, m) oc \ 



with Ts = X with Tvi, 



for r ^ Tvir 
otherwise 

3m 



(A7) 



-. n — X — 7-^ and c(m, z) be- 

ing the so-called viral radius and concentration parameter, 
resp ectively. For the ove r-density within the virial radius we 
use (jBuUock et al.ll200ll l: 



Avir(2) 



IStt^ 4- 82{flm{z) - 1) - 39(nm(z) - 1)^ 

r2m(z) 



(A8) 



Pcrit (at redshift z = 0) and Q,in{z) are the critical mat- 
ter density and matter density parameter, respectively; 
fim = f^m(z = 0) . The con centration parameter is calculated 
according to lSehakI (|2000l '): 



c(m, z) 



10 

l + z 



m 



(A9) 



where the present day non-linear mass scale, m,, is de- 
fined by Sc — (T(m*). For our adopted cosmology we find 
m. = 1.9 X lO^^h-^Ma. 



APPENDIX B: EXACT STATISTICAL 
MOMENTS FOR CENTRAL GALAXY MODELS 

As can be seen in Table O in a scenario where we split the 
halo occupation number of one population (mixed scenario) 
or both populations (pure central scenario) into one central 
galaxy and satellite galaxies one has to specify (N^^^ N^'^'^) , 
(A^r'A^r') or (Nf^'^Nj), {Af™iVj) as function of halo mass 
m, respectively. Here, we give general expressions in terms 
of the JHOD, P{Ni, Nj\m), of these moments. A detailed 
derivation for one of the moments is given, the others can 
be obtained in a similar manner. Note that we skip the ar- 
guments "(m)" and "|m" in the JHOD for the derivation; 
all following equations are for haloes of a fixed mass. 

The central galaxy-satellite splitting is done in such a 
way that we always have A^'^'^" = 1 if iV > 1, A/'""' = for 



N = 0, iV= 



TV - 1 for iV > 1 and A^^"^' = for A^ 1. 



This is to say that (H(x) is the Heaviside step function) 



Nr = H (AT, - 1) (N, 
AT" = H (TV, - 1) . 



1) 



(Bl) 
(B2) 



We focus on (TVf'^^TV™"). Let us substitute the previous 
expressions and write down explicitly the ensemble average 
in the statistical moment as sum over the JHOD (the states, 
halo occupation numbers, are integers), taking into account 
the Heaviside step functions: 



oo oo 



= E E {N,-i)P{N.,N,) 

Ni=0 Nj=0 
oo 

-EE {N^-1)P{N,,N,) 

oo 

-EE iN,^l)PiN,,N,) 



+ E E {N^-1)P{N.,N,) 

oo 
JVi=0 

oo 

+ E^wi^» = o)- E PiN^'N,) 

Nj=0 Ni,Nj=0 

= (a,J> - 1 - (^N,\N, = o) + PiN, = 0) 
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+ P{N^ = 0) - P{N, = 0, Nj = 0) 

= (n^) -1- {n^\N, = 0) + P{N,Nj = 0) , (B3) 



(B4) 



where we denote by 



N^\Nj = O) = ^ N,P{Ni,Nj = 0) 

JVi=0 



the conditional halo-average of A''^ for which A'j — 0, whereas 
P{NiNj = 0) denotes the probability to find a halo which 
has either Ni = or Nj — 0. The conditional mean is 
zero, if all haloes have at least one j-galaxy. We have used 
in the foregoing calculus the relations (also with i and j 
interchanged) 



pm = Yl p(N^'N^) 



(B5) 



P{N,N,=0) = PiN, = 0) + PiN,=0) - PiN, = 0,Nj=0) 



and 





^ ^ N,P{N.,N, 



(B6) 



iVi=0 iV,-=0 



Analogues to this little exercise in probability theory, 
we find: 



^iVr*A^f') = (B7) 

= [{N^) - 1] [{n,) R^MN.MNj) 

+ (^N,\N, = o) + (^Nj\N, = o) - PiN.Nj = 0) , 
and for the mixed scenario: 

(iVr TV,) = (tV,) - (iV, 17V, = O) , (B8) 
and 

^iVr'iVj) = (B9) 

[(iv,) - 1] (iv,) + R^Mm^iNj) + {nan, = o) . 

In conclusion, we find that the statistical moments for 
the 2"'^-order clustering statistics are surprisingly complex 
in a central galaxy scenario. Only in very simple cases, such 
as no correlation between distinct galaxy populations, i.e. 
for the pure central scenario 



and 



(BIO) 
(Bll) 



we find simple statistical cross-moments. Another trivial 
case is given, if the haloes, for a fixed mass, always contain 
"i"- and "j"-galaxies: the conditional means and P{NiNj = 
0) in the foregoing equations vanish. Simply providing the 
fluctuations in the satellite number, the mean number of 
satellites and central galaxies, and the JHOD correlation 
factor, Rij, does not suffice in general, though. We have to 
make additional assumptions about the JHOD in order to 
work out all factorial moments of second order. 



APPENDIX C: ESTIMATION OF BAYESIAN 
EVIDENCE 

In applying Bayesian statistics to the problem of model fit- 
ting (MacKav 2003) one assesses the posterior probability 
function of the model parameters given the observed data, 
d: 

P^pld) = ^^'^^^^f^^^^ cc P{d\p)P(p) ; (CI) 

P{p) is a prior on the model parameters, possibly obtained 
from other observations, or motivated theoretically. P[d\p) 
is the probability of the data given a set of model parameters 
(likelihood function); this involves essentially the specifica- 
tion of a PDF for the noise in the observation. In this paper, 
we tackle the problem of estimating the posterior likelihood, 
P{p\d), by using the MCMC technique. 

For the mere constraints in model parameter space the 
probability of the observed data, P{d), is unimportant be- 
cause it is just some constant. It can be used in statistics, 
however, to discriminate between a set of models trying to 
explain the data simultaneously, since P{d) is actually the 
probability to obtain the data under the assumption that a 
certain model is correct and taking into account the remain- 
ing uncertainties in p. As PDFs are normalised to one, one 
finds for a particular model that 



P(d) = I dpP{d\p)Pip) 



(C2) 



where the integral is over the full parameter space. 

Evaluating this integral requires, compared to the pos - 
terior, its own way of treatment (e.g. iHeavens et al.ll2007l ). 
We will not go into that h ere. Instead, we are using the ap- 
proximation mentioned in iMacKavl l|2003l ) , by saying that 
the posterior likelihood around the maximum-likelihood 
point can satisfyingly well described by a multivariate Gaus- 
sian; the inverse covariance of the parameter estimates - the 
Hessian of the (log)po sterior - at this point is the Fisher 
matrix /MacKavl bOol . F, which can be estimated from the 
MCMC tracks: 



In E = In P{d) 



(C3) 



= [Indet (27rC) +x + Indet (F/27r) - 21nP(p^i) 



where P{p^i) is the prior likelihood of the maximum- 
likelihood parameters p^^ of the model, C is the covari- 
ance of the data noise (Gaussian) and is evaluated at the 
maximum- likelihood point. By lni5 we mean the Bayesian 
evidence of the data for a given model. 

A model "A" with larger In i5a is more likely to re- 
produce the observed data than a model "B" with smaller 
evidence In iJt,. Depending on the ratio of evidence probabil- 
ities, or difference Aln_E = In Ea — InEt, we would favour 
model "A" ov er model "B" ( A In i? = 1 is a relative proba- 
bility of 37%). |jeffrevl (f 196ll ) suggests that the difference of 
two models is "substantial" for 1 ^ AlniS < 2.5, "strong" 
for 2.5 Aln_E < 5 and "decisive" otherwise. 

This paper has been typeset from a TpjX/ KTjtjX file prepared 
by the author. 



